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Progress  Report  for  the  SPRING  Project  on  MECHANICAL  ENERGY 
ON  DEMAND  FROM  HIGH  STRAIN  ACTUA  TORS 
(October  2006  -  May  2008) 

Principal  Investigator:  Ray  H.  Baughman 
Co-Principal  Investigator:  Richard  Vaia 

I  INTRODUCTION 

Specific  needs  for  the  Department  of  Defense  for  Energy  on  Demand, 
include  small  volume,  low  weight,  energy-efficient  actuators  that  provide  very 
large  strokes  and  very  high  power  densities.  These  are  required  for  such  diverse 
uses  as  deployable  structures,  morphing  wings  on  aircraft,  micro-air  vehicles, 
exoskeletons  for  future  soldiers,  autonomous  robots,  and  smart  surfaces  to 
increase  the  efficiency  and  performance  of  aircraft  and  marine  vehicles.  The 
program  plan  goal  is  to  demonstrate  both  fundamentally  new  and  dramatically 
improved  mechanisms  for  actuation. 

The  greatest  program  success  was  in  the  demonstration  of  two 
mechanisms  for  achieving  giant  stroke  actuation  of  nanotube  sheets.  These 
mechanisms  require  no  electrolyte,  and  provide  actuator  strokes  of  up  to  500% 
and  response  rates  of  up  to  104%/s  for  temperatures  from  below  77  K  to  above 
1900  K.  A  patent  application  will  be  filed  on  this  broad  technology  and  a 
manuscript  is  being  prepared  for  publication  in  Science. 

Work  was  conducted  in  another  exciting  proposed  technology  for  giant 
stroke  actuation,  though  success  has  not  yet  been  achieved.  This  second 
technology,  non-thermal  electrical  shape  memory  actuation,  is  revolutionary  in 
concept  and  performance;  electrochemical  double-layer  charge  injection  in  a 
high-surface-area  shape  memory  metal  causes  a  shift  in  the  martensitic  phase 
transition  -  thereby  providing  about  5%  strain  at  applied  voltages  of  a  few  volts 
and  over  100%  strains  at  these  applied  voltages  for  shape  memory  wire  springs. 
While  ordinary  shape  memory  actuators  have  giant  power  densities,  far  above 
other  artificial  muscles,  they  are  thermal  engines.  Hence,  their  efficiency  is 
limited  by  the  Carnot  efficiency  of  a  heat  engine. 

The  third  investigated  technology  ( low  voltage  elastomer  actuators)  is 
evolutionary  in  terms  of  mechanism,  but  revolutionary  in  terms  of  performance. 
The  elastomer-based  artificial  muscle  technology  developed  by  Stanford 
Research  Institute  and  Artificial  Muscles,  Inc.  provides  high  efficiencies  and  high 
power  densities,  but  requires  the  use  of  several  thousand  volts  for  actuation.  We 
can  decrease  this  voltage  by  up  to  two  orders  of  magnitude  by  replacing  the 
current  thick  conducting  grease  electrodes  with  50  nm  thick,  highly  elastomeric 
nanotube  electrodes  invented  at  UTD. 

Fundamentally  new  types  of  fuel-powered  artificial  muscles,  which  result  in 
electrical  dopant  intercalation  in  conducting  polymers,  were  invented,  and  a 
corresponding  international  patent  application  was  filed. 


II.  PROGRAM  PROGRESS 


A.  GIANT  STROKE  AND  RATE  CARBON  NANOTUBE  SHEET  ACTUATORS 
We  have  discovered  that  solid-state  fabricated  carbon  nanotube  aerogels  sheets 
provide  an  unusual  state  of  matter,  which  behaves  as  a  low  modulus  enthalpic 
rubber  for  stretch  in  the  width  direction  and  has  higher  gravimetric  strength  than 
steel  plate  for  stretch  in  the  length  direction,  which  is  the  nanotube  orientation 
direction.  More  importantly  for  the  present  program,  we  have  discovered  that 
these  aerogel  sheets  can  be  electrically  operated  in  air  as  artificial  muscles  that 
provide  strokes  and  stroke  rates  as  high  as  500%  and  104%/s,  respectively. 
Charge-driven,  current-driven,  and  both  dc  and  resonant  linear  actuation  were 
demonstrated,  which  utilize  voltages  ranging  from  volts  to  kilovolts,  and  produce 
either  electrically  driven  expansion  or  contraction  from  below  77  K  to  above  1900 
K.  Actuation  can  decrease  aerogel  density  to  below  10‘4  g/cm3  which  is  ten  times 
lower  than  the  previous  record  for  a  self-supporting  aerogel.  The  actuated  state 
of  the  nanotube  sheets  can  be  permanently  frozen,  which  enables  tuning 
nanotube  structure  for  device  applications,  such  as  for  solar  cell  electrodes. 
Observed  Poisson’s  ratios  (~9)  are  ~30  times  usual  values,  which  causes 
confinement  of  muscle  length  to  profoundly  effect  lateral  actuation. 


B.  NON-THERMAL  ELECTRICAL  SHAPE-MEMORY  ACTUATION 

Development  of  giant  surface  area  shape  memory  materials  is  the  key 
challenge  for  demonstrating  this  fundamentally  new  high-stroke  actuator 
mechanism,  which  is  not  limited  by  either  the  Carnot  efficiency  constraints  of 
thermally  actuated  shape  memory  materials  or  the  low  actuator  strains  of 
magnetically  operated  shape  memory  materials.  This  challenge  is  not  yet  solved. 
However,  initial  research  progress  has  been  made  on  developing  new  methods 
for  the  production  of  the  required  giant  surface  area  shape  memory  materials. 
The  first  new  method  is  by  electrochemical  growth  of  50  nm  or  smaller  diameter 
shape  memory  wires  that  are  over  50  microns  long  via  electrodeposition  using 
MHz  square  wave  pulses.  Norm  Barisci  of  our  NanoTech  Institute  worked  with 
Brett  Fanders  group  of  Oklahoma  State  University  (who  is  an  expert  on  this 
nanowire  fabrication  route),  and  a  joint  paper  on  achieved  results  for  conducting 
polymer  nanowires  will  be  submitted.  The  second  method  starts  with  25  micron 
diameter  NiTi  wires  and  dramatically  decreases  wire  diameter  (or  thickness  in  the 
thinnest  direction)  by  a  combination  of  mechanical  draw,  mechanical  conversion 
of  the  shape  memory  wire  to  a  ribbon,  and  electrochemical  erosion.  Other 
methods  are  also  being  pursued,  like  deposition  of  10  nm  thick  shape  memory 
films  by  sputtering  (followed  by  substrate  removal  in  selected  area)  to  provide  a 
micro  actuator  having  giant  surface-to-volume  ratio. 


C.  LOW  VOLTAGE  ELASTOMER  ACTUATORS 

Our  collaborative  program  work  with  Artificial  Muscles,  Inc.  has  provided 
significant  new  results  on  the  application  of  highly  elastomeric  carbon  nanotube 
sheets  as  electrodes  for  low  voltage  elastomer  electrodes.  First,  it  is  important 
that  the  elastomeric  carbon  nanotube  electrodes  do  not  significantly  hinder 
actuation  of  the  dielectric  polymer  sheets.  Measurement  results  showed  that  this 
was  the  case  for  our  water-densified,  elastomeric  nanotube  sheet  electrodes  that 
provided  below  700  ohm/sq  sheet  resistance.  Second,  it  is  important  that  the 
nanotube  sheet  electrodes  become  resistive  in  the  vicinity  of  pinholes  in  the 
dielectric  polymer  (so  that  catastrophic  failure  by  electrical  breakdown  does  not 
occur).  Unfortunately,  we  find  that  MWNTs  did  not  provide  this  self-healing 
feature.  This  problem  is  that  the  multi-walled  nanotubes  (MWNTs)  in  our 
elastomeric  sheets  do  not  vaporize  -  thereby  interrupting  current  flow  at  pin 
holes.  The  problem  fix  that  we  will  use  is  to  deploy  either  single  walled  nanotubes 
(SWNTs)  of  few  walled  nanotubes  (FWNTs)  in  our  solid-state  fabricated  sheets. 
Extension  of  our  sheet  fabrication  process  to  SWNTs  and  FWNTs  is  in  progress. 


III.  AWARDS  IN  PROGRAM  AREA  SINCE  PROGRAM  START 

1 .  Nano50  Award  of  Nanotech  Briefs  Magazine  for  Fuel  Powered  Artificial 
Muscles  (2007) 

2.  Kapitza  Metal  of  the  Russian  Academy  of  Natural  Sciences  for  discoveries  on 
artificial  muscles  (2007). 


IV.  PATENT  FILING  INCLUDING  PROGRAM  RESULTS 

1 .  “Fuel  Powered  Actuators”,  V.  H.  Ebron,  Z.  Yang,  D.  J.  Seyer,  M.  Kozlov,  J. 
Oh,  H.  Xie,  J.  Razal,  J.  P.  Ferraris,  A.  G.  MacDiarmid  and  R.  H.  Baughman, 
Filed  PCT  International  Application  (2007),  PCT  US  2007/063241 . 


V.  ACTUATOR  PUBLICATIONS  SINCE  PROGRAM  START 

1 .  "Electrochemical  Actuation  of  Carbon  Nanotube  Yarns",  T.  Mirfakhrai,  J. 
Oh,  M.  Kozlov,  E.  C.  W.  Fok,  M.  Zhang,  S.  Fang,  R.  H.  Baughman,  and  J. 
D.  Madden,  Invited  Paper  in  Journal  of  Smart  Materials  and  Structures  16, 
S243-S249  (2007). 

2.  “Polymer  Artificial  Muscles”,  T.  Mirfakhrai,  J.  D.  W.  Madden  and 

R.  H.  Baughman,  invited  review  for  Materials  Today  10,  30-38  (2007). 


3.  “Capacitive  Charging  and  Background  Processes  in  Carbon  Nanotube 
Yarn  Actuators”,  T.  Mirfakhrai,  M.  Kozlov,  M.  Zhang,  S.  Fang,  R.  H. 
Baughman,  and  J.  D.  W.  Madden,  Proceedings  of  SPIE,  Vol.  6524, 

65241 H,  1-12  (2007). 

4.  “Carbon  Nanotube  Electroactive  Polymer  Materials:  Opportunities  and 
Challenges”,  L.  Qu,  Q.  Peng,  L.  Dai,  G.  M.  Spinks,  G.  G.  Wallace,  and 

R.  H.  Baughman,  Materials  Research  Society  Bulletin  33,  215-224  (2008) 


VI.  INVITED  TALKS  ON  ACTUATORS  SINCE  PROGRAM  START 

"New  Technologies  from  UTD's  NanoTech  Institute:  From  High  Performance 
Nanotube  Yarns  and  Sheets  to  Fuel-Powered  Artificial  Muscles",  The  6th 
Emerging  Information  Technology  Conference  (Dallas,  Texas,  August  10-11, 
2006). 

“Fuel  Powered  Artificial  Muscles”,  NanoBio  Convergence  Group  Lecture  Series 
(Sept.  13,  2006,  Menlo  Park,  CA). 

“NanoTech  Institute  Inventions:  From  Multifunctional  Carbon  Nanotube  Yarns 
and  Transparent  Metallic  Sheets  to  Fuel-Powered  Muscles  and  Devices  for 
Energy  Harvesting,  Storage,  and  Conversion”,  NanoTX’06  Conference  on  The 
Promise  of  Tomorrow,  The  Business  of  Nanotechnology  (Sept.  27-28,  2006, 
Dallas,  TX) 

“From  Multifunctional  Carbon  Nanotube  Yarns  and  Transparent  Sheets  to  Fuel- 
Powered  Muscles  and  Devices  for  Energy  Harvesting,  Storage,  and  Conversion”. 
Carnegie  Mellon  University  (Nov.  21, 2006,  Pittsburgh,  PA). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Fall  Materials  Research 
Society  Meeting  (Nov.  29-Dec.  1,  Boston,  Massachusetts). 

“NanoTech  Institute  Inventions:  From  Multifunctional  Carbon  Nanotube  Yarns 
and  Transparent  Metallic  Sheets  to  Fuel-Powered  Muscles  and  Devices  for 
Energy  Harvesting,  Storage,  and  Conversion”,  International  Conference  on 
Nanoscience  and  Nanotechnology  (Dec.  7-8,  2006,  Guangji,  Korea). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  KITECH  (Dec.  8,  2006, 
Seoul,  Korea). 


“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Hanyang  University  (Dec.  11, 
2006,  Seoul,  Korea). 


“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  2006  International 
Workshop  on  Innovations  and  Advanced  Studies  -Energy,  Biomedicine, 

Enabling  Materials  and  Micro-Nano  Science  and  Technology,  National  Cheng 
Kung  Univ.  (Dec.  13-15,  2006,  Tainan,  Taiwan). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Award  Lecture  for  Kapitza 
Metal  (Jan.  18,  2007,  Moscow,  Russia). 

“From  Multifunctional  Carbon  Nanotube  Tarns  and  Transparent  Sheets  to  Fuel- 
Powered  Muscles  and  Devices  for  Energy  Harvesting,  Storage,  and  Conversion”, 
Honeywell  Corporation  (January  22,  2007,  Morristown,  New  Jersey). 

“Strain  Amplification  for  Artificial  Muscles  and  Sensors  Using  Giant  Poisson 
Ratios  and  Giant  Linear  Compressibilities”,  2nd  International  Symposium  on 
Electromaterials  Science  (Feb.  7-9,  2007,  Wollongong,  Australia). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Euroconference  on 
Electronic  Properties  of  Novel  Materials  (March  10-16,  2007,  Kirchberg,  Austria). 

“Autonomous  Carbon  Nanotube  and  Shape  Memory  Yarn,  Sheet,  and  Wire 
Systems”,  SPIE  Smart  Structures  and  Materials  Symposium  (March  18-22,  2007, 
San  Diego,  CA). 

“New  Technologies  from  UTD’s  NanoTech  Institute”,  Banquet  Lecture  for 
Raytheon’s  7th  Electro-Optical  Systems  Technology  Network  Conference  (May 
15-16,  2007,  Richardson,  Texas). 

“From  Multifunctional  Carbon  Nanotube  Tarns  and  Transparent  Sheets  to  Fuel- 
Powered  Muscles  and  Devices  for  Energy  Harvesting,  Storage,  and  Conversion”, 
Dinner  Lecture  for  Institute  for  Innovation  and  Invention  (June  14,  2007, 
Richardson,  Texas). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Southern  Methodist 
University  (July  13,  2007,  Dallas,  Texas). 

“NanoTech  Institute  Inventions:  From  Multifunctional  Carbon  Nanotube  Yarns 
and  Transparent  Metallic  Sheets  to  Fuel-Powered  Muscles  and  Devices  for 
Energy  Harvesting,  Storage,  and  Conversion”,  Evening  lecture  for  The  Indus 
Entrepreneurs  (June  14,  2007,  Dallas,  TX). 

“From  Electrical  to  Fuel  Powered  Artificial  Muscles”,  Texas-Korea  Nano 
Workshop  (August  6-8,  2007,  Richardson,  TX). 

“NanoTech  Institute  Inventions:  Transparent  Metallic  Sheets,  Fuel-Powered 
Muscles  and  More”,  MetroCon  (IEEE)  Conference  (Oct.  10,  2007,  Arlington,  TX). 


“From  Multifunctional  Carbon  Nanotube  Yarns  and  Transparent  Sheets  to  Fuel- 
Powered  Muscles  and  Devices  for  Energy  Harvesting,  Storage,  and  Conversion”, 
General  Electric  (Oct.  23,  2007,  Niskayuna,  NY). 

“Nanotechnology  for  Fun  and  Profit”,  Carnegie  Mellon  University  Lecture  Series  - 
Lecture  preceding  receipt  of  2007  Distinguished  Alumni  Award  (Oct.  26,  2007, 
Pittsburgh,  PA). 

“From  Electrical  to  Fuel-Powered  Artificial  Muscles”,  Plenary  Lecture  at  2007 
National  Nano  Engineering  Conference,  prior  to  receiving  Nano  50  Award  for 
Fuel-Powered  Artificial  Muscles  (Nov.  14,  2007,  Boston,  Massachusetts). 

“Nanotechnology  for  Fun  and  Profit”,  Petersen  Institute  of  NanoScience  and 
Engineering,  University  of  Pittsburgh  (Feb  4,  2008,  Pittsburgh,  PA). 

“Diverse  Carbon  Nanotube  Artificial  Muscles  Meet  an  Exciting  New  Family 
Member”,  Monash  University  (Feb.  21-22,  2008,  Melbourne,  Australia). 

“Diverse  Carbon  Nanotube  Artificial  Muscles  Meet  an  Exciting  New  Family 
Member”,  US  AFRL-lsraeli  Bio/Nano  Workshop  for  Materials  (March  24-25,  2008, 
San  Francisco,  CA). 

“Diverse  Carbon  Nanotube  Artificial  Muscles  Meet  an  Exciting  New  Family 
Member”,  ChemOnTubes  08,  (April  6-9,  2008,  Zaragoza,  Spain). 

“Diverse  Carbon  Nanotube  Artificial  Muscles  Meet  an  Exciting  New  Family 
Member”,  Nano  for  Defense  Applications  (April  21-23,  2008,  Arlington,  Virginia). 


Electronic  Textile  Sensors  for  Stress  Measurement  in  Soldiers 


Pis:  Bruce  Gnade,  Roozbeh  Jafari  and  Chris  Gintz 


Abstract 

This  project  focused  on  building  a  platform  with  electronic  textile  and  wearable  sensors  to  detect 
acute  stress  level  in  human.  Our  earlier  investigation  and  literature  search  indicate  that  several 
physiological  parameters  can  be  used  to  identify  stress.  They  include:  1)  nature  of  physical  activities  the 
subject  is  engaged  in,  2)  skin  conductance  and  3)  heart-beat  variability.  To  detect  these  parameters,  we 
proposed  a  wearable  system  with  1)  inertial  sensors  (accelerometers  and  gyroscopes)  and  flex  sensors,  2) 
galvanic  skin  response  sensors  and  3)  pizeo-electric  film  sensors. 

We  built  a  wearable  platform  with  multiple  tiny  sensor  units.  Each  unit  is  composed  of  a 
microcontroller  (Texas  Instrument  MSP430),  a  radio  module,  several  analog  and  digital  interfaces  and  a 
flash  memory.  We  designed  and  implemented  a  software  infrastructure  that  performs  data  collection,  and 
enables  on-board  and  collaborative  signal  processing. 

We  integrated  motion  and  flex  sensors  to  detect  a  set  of  physical  activities  (stand  to  sit,  sit  to  lie, 
kneeling  and  grasping,  reach  up  with  two  hands,  turning  crotch  counter  clockwise,  jumping  on  footrest, 
lifting  object  from  table,  bend  and  grasp,  going  upstairs,  kneeling)  and  investigated  several  challenges 
including:  1)  sensor  optimization  (where  the  sensors  need  to  be  placed),  2)  light-weight  and  real-time 
signal  processing  techniques  that  interpret  sensor  readings  into  indispensable  information  for  users  (i.e. 
what  type  of  activity  was  performed,  for  how  long,. .  .etc). 

We  used  an  off-the-shelf  galvanic  skin  response  sensor  to  detect  the  skin  conductance.  We  designed 
and  built  an  interface  to  connect  the  galvanic  skin  response  sensor  to  our  wearable  platform. 

We  integrated  piezo-electric  film  sensor  into  our  system  for  detecting  the  heartbeat  rate.  We  built  an 
analog  interface  that  consists  of  amplifiers  and  filters.  This  interface  performs  signal  conditioning  and 
generates  a  suitable  output  for  the  analog  interface  of  our  wearable  platform. 

We  collected  data  from  several  participants  and  established  the  effectiveness  of  the  system  and  signal 
processing.  A  maximum  accuracy  of  94%  was  achieved  for  movement  classification  while  the 
effectiveness  of  other  sensors  was  established. 


1.  Introduction 


Our  research  focuses  on  determining  stress  level  using  several  small  wearable  sensors.  First,  we  must 
clearly  define  the  types  of  stress  and  examine  their  causes,  symptoms,  and  effects.  After  that,  we  will 
explain  our  methodology  for  exploiting  the  immediate  biological  effects  of  stress  to  determine  the  the 
wearer’s  level  of  stress.  Then,  we  will  present  the  sensor  systems  and  signal  processing  techniques  we 
developed  to  measure  these  effects  and  report  experimental  validation  of  these  techniques.  Finally  we  will 
detail  our  continued  research  strategy  for  developing  a  complete  stress  evaluation  system. 

1.1  Definition  of  Stress 

Stress  is  the  sum  of  the  physical,  mental,  and  emotional  tensions  on  a  person.  It  is  caused  by  a  number 
of  stressors,  which  are  stimuli  or  events  that  provoke  a  stress  response  in  an  organism.  Stressors  include 
physical  factors  such  as  temperature  or  pain  as  well  as  psychological  factors  including  relationships  with 
friends  and  work.  The  body’s  immediate  response  to  stress  is  to  prepare  for  combat  or  retreat.  Periodic 
exposure  to  stress  is  healthy,  but  frequent  or  chronic  exposure  to  stress  can  be  psychologically  damaging 
and  lead  to  health  problems. 

1.3.  Types  of  Stress 

Acute  stress:  This  is  the  most  common  form  of  stress.  It  comes  from  demands  and  pressures  of  the 
recent  past  and  anticipated  demands  and  pressures  of  the  near  future.  Acute  stress  can  be  thrilling  and 
exciting  in  small  doses,  but  too  much  is  exhausting.  Because  it  is  short-term,  acute  stress  doesn't  have 
enough  time  to  do  the  extensive  damage  associated  with  long-term  stress.  This  type  of  stress  may  appear 
in  anyone's  lifeT  and  is  highly  treatable  and  manageable.  The  effects  are  immediate  and  easily 
differentiable  from  normal  body  responses. 

The  effects  can  include  emotional  stress  (anger,  irritability,  anxiety,  depression),  muscular  problems 
(tension  headaches,  back  pain,  and  muscular  tensions),  digestive  disorders  (heartburn,  diarrhea, 
flatulence,  constipation),  general  arousal  symptoms  (rapid  heartbeat,  sweaty  palms,  heart  palpitations, 
dizziness,  and  cold  hands). 

Acute  stress  is  easily  measured  because  of  the  immediate  and  drastic  biological  effects.  This  is  the 
primary  stress  our  system  is  capable  of  observing. 

Episodic  Acute  Stress:  Frequent  exposure  to  acute  stress  is  Episodic  Acute  Stress.  This  can  result 
from  high  stress  jobs,  poor  organizational  skills,  or  over-commitment.  People  who  perpetually  experience 
this  type  of  stress  can  be  irritable  and  abrupt,  and  have  problems  maintaining  relationships, 

The  symptoms  of  episodic  acute  stress  are  those  of  extended  over-arousal:  persistent  tension 
headaches,  migraines,  hypertension,  chest  pain,  and  heart  disease.  Often,  lifestyle  and  personality  issues 
are  so  ingrained  and  habitual  with  these  individuals  that  they  see  nothing  wrong  with  the  way  they 
conduct  their  lives.  They  often  blame  their  woes  on  other  people  and  external  events.  Frequently,  they  see 
their  lifestyle,  their  patterns  of  interacting  with  others,  and  their  ways  of  perceiving  the  world  as  part  of 
who  and  what  they  are. 

Treating  episodic  acute  stress  requires  intervention  on  a  number  of  levels,  generally  requiring  long¬ 
term  professional  help.  However,  sufferers  can  be  fiercely  resistant  to  change.  Only  the  promise  of  relief 
from  pain  and  discomfort  of  their  symptoms  can  keep  them  in  treatment  and  on  track  in  their  recovery 
program. 

Our  system  would  be  able  to  observe  frequent  exposure  to  acute  stress  and  flag  individuals  who 
experience  this  for  examination,  but  a  psychiatrist  is  required  for  accurate  diagnosis  and  treatment. 

Chronic  Stress:  While  acute  stress  can  be  thrilling  and  exciting,  chronic  stress  is  not.  This  is  the 
grinding  stress  that  wears  people  away  day  after  day,  year  after  year.  Chronic  stress  destroys  bodies, 


minds  and  lives.  It  wreaks  havoc  through  long-term  attrition.  It's  stress  of  poverty,  of  dysfunctional 
families,  unhappy  marriages,  or  despised  jobs.  Chronic  stress  comes  when  a  person  never  sees  a  way  out 
of  a  miserable  situation.  It's  the  stress  of  unrelenting  demands  and  pressures  for  seemingly  interminable 
periods  of  time.  The  worst  aspect  of  chronic  stress  is  that  people  become  accustomed  to  it.  They  forget  it's 
there.  People  are  immediately  aware  of  acute  stress  because  it  is  new;  they  ignore  chronic  stress  because  it 
is  old,  familiar,  and  sometimes,  almost  comfortable. 

Chronic  stress  kills  through  suicide,  violence,  heart  attack,  and  stroke.  Because  physical  and  mental 
resources  are  depleted  through  long-term  attrition,  the  symptoms  of  chronic  stress  are  difficult  to  treat  and 
may  require  extended  medical  as  well  as  behavioral  treatment  and  stress  management. 

Common  causes  of  chronic  stress  include  poverty  and  financial  worries;  long-term  unemployment; 
dysfunctional  family  relationships;  caring  for  a  chronically  ill  family  member;  feeling  trapped  in 
unhealthy  relationships  or  career  choices;  living  in  an  area  besieged  by  war,  ethnic  rivalry,  or  criminal 
violence;  bullying  or  harassment;  and  perfectionism. 

Because  this  stress  is  so  integral  to  an  individual’s  life,  our  system  would  be  completely  unable  to 
differentiate  between  chronic  stress  and  normal  behavior. 

1.3.  Biological  Effects  of  Acute  Stress 

The  focus  of  our  research  is  acute  stress,  as  that  is  most  easily  measured.  Of  particular  interest  is  the 
immediate  biological  response  to  acute  stress.  In  many  cases,  this  response  produces  effects  which  can  be 
measured  with  sensors  placed  on  different  parts  of  the  body.  This  is  particularly  true  with  heart,  lungs, 
and  skin. 

Brain's  Response  to  Acute  Stress:  In  humans,  the  biochemical  response  to  acute  stress  is  known  as 
the  "fight-or-flight"  reaction.  It  begins  with  the  activation  of  a  section  of  the  brain  called  the 
hypothalamic-pituitary-adrenal  system,  or  HPA.  This  system  first  activates  the  release  of  steroid 
hormones,  which  are  also  known  as  glucocorticoids.  These  hormones  include  cortisol,  the  primary  stress 
hormone  in  humans.  The  HPA  system  then  releases  a  set  of  neurotransmitters  known  as  catecholamines, 
which  include  dopamine,  norepinephrine,  and  epinephrine  (also  known  as  adrenaline).  Catecholamines. 
These  hormones  have  three  important  effects: 

•  Activation  of  the  amygdala,  an  almond-shaped  structure  in  the  limbic  system  that  triggers  an 
emotional  response  of  fear. 

•  Signaling  the  hippocampus,  another  part  of  the  limbic  system,  to  store  the  emotional  experience 
in  longterm  memory. 

•  Suppression  of  activity  in  parts  of  the  brain  associated  with  short-term  memory,  concentration, 
and  rational  thinking.  This  suppression  allows  a  human  to  react  quickly  to  a  stressful  situation, 
but  it  also  lowers  ability  to  deal  with  complex  social  or  intellectual  tasks  that  may  be  part  of  the 
situation. 

Response  of  the  Heart,  Lungs,  and  Circulation:  The  stress  response  also  affects  the  heart,  lungs, 
and  circulation.  As  stress  increases,  the  heart  rate  increases  and  becomes  less  erratic,  and  blood  pressure 
also  increases  instantaneously.  Breathing  becomes  rapid  and  the  lungs  take  in  more  oxygen.  Furthermore, 
the  spleen  discharges  red  and  white  blood  cells,  allowing  the  blood  to  transport  more  oxygen  throughout 
the  body.  Blood  flow  may  actually  increase  300%  to  400%,  priming  the  muscles,  lungs,  and  brain  for 
added  demands. 

Response  of  the  Immune  System:  The  effect  on  the  immune  system  from  confrontation  with  stress 
is  similar  to  marshaling  a  defensive  line  of  soldiers  to  potentially  critical  areas.  The  steroid  hormones 
dampen  parts  of  the  immune  system,  so  that  specific  infection  fighters  (including  important  white  blood 
cells)  or  other  immune  molecules  can  be  redistributed.  These  immune-boosting  troops  are  sent  to  the 
body’s  front  lines  where  injury  or  infection  is  most  likely,  such  as  the  skin,  the  bone  marrow,  and  the 
lymph  nodes. 


Response  of  the  Mouth  and  Throat:  In  response  to  stress,  fluids  are  diverted  from  nonessential 
locations,  including  the  mouth.  This  causes  dryness  and  difficulty  in  talking.  Stress  can  also  cause  spasms 
of  the  throat  muscles,  making  it  difficult  to  swallow. 

Response  of  the  Skin:  The  stress  effect  diverts  blood  flow  away  from  the  skin  to  support  the  heart 
and  muscle  tissues.  The  physical  effect  is  a  cool,  clammy,  sweaty  skin.  The  scalp  also  tightens  so  that  the 
hair  seems  to  stand  up.  (temporary  acne  is  also  a  result  of  stress) 

We  have  suspicion  that  Skin  Conductance  and  Heart  Rate  are  promising  signs  for  detecting  stress 
level  using  flexible  electronics.  Although  these  two  signals  have  potentials  for  our  target  application,  they 
would  be  affected  by  other  physical  and  environmental  parameters.  In  particular,  the  level  of  stress  in  a 
human  would  change  under  different  physical  activity  conditions,  temperature  and  humidity.  Therefore, 
we  believe  that  the  system  needs  to  be  embedded  with  other  types  of  sensors  to  capture  signals  correlated 
with  Skin  Conductance  and  Heat  Rate. 

2.  System  Components 

Most  biomedical  applications,  including  stress  level  recognition,  require  a  set  of  sensors  that  are 
spatially  distributed  because  of  (a)  diversity  for  sensitivity,  performance,  accuracy,  and  robustness,  and 
(b)  constraints  posed  by  the  location  of  the  desirable  sensing  points  on  the  surface  of  the  human  body. 
The  sensors  are  responsible  for  collecting  data,  while  the  processing  units  provide  filtering  and  noise 
reduction  on  the  signals.  Due  to  the  nature  and  usage  of  our  sensor  concepts,  the  processing  units  must  be 
fairly  simple,  because  the  devices  and  circuits  will  be  fabricated  using  organic  semiconductors.  Our 
system  is  composed  of  processing  units,  sensors  and  a  gateway  station. 

2.1.  Processing  units 

The  most  important  components  of  our  system  are  distributed  processing  units  that  can  support 
various  types  of  sensors  for  physiological  readings  from  the  human  body.  These  blocks  are  responsible 
for  reading  from  the  sensors,  executing  preprocessing  (filtering,  segmentation,... etc),  feature  extraction, 
and  local  classification.  Furthermore,  they  enable  communications  that  will  lead  to  global  classification 
on  a  centralized  node.  At  the  current,  we  have  been  sending  out  the  raw  data  for  off-line  processing.  The 
processing  units  that  we  utilize  are  Telos-B  motes  developed  at  University  of  California,  Berkeley. 

2.2.  Sensors 

We  have  investigated  and  purchased  several  types  of  off-the-shelf  sensors  that  can  be  potentially  used 
for  stress  evaluation: 

2.2.1.  Fully  Integrated  Sensors 

•  Fingertip  skin  conductance  (flexible  electronic  sensor):  Skin  conductance  is  considered  to  be  a 
function  of  sweat  gland  activity  and  skin  pore  size,  both  of  which  are  controlled  by  the 
sympathetic  nervous  system.  Sweat  glands  are  activated  when  the  sympathetix  nervous  system  is 
aroused  in  response  to  stress  or  anxiety.  As  sweat  is  produced,  the  skin's  capacity  to  conduct 
current  is  enhanced  and  the  measured  conductance  is  increased.  For  this  particular  sensor,  a  small 
voltage  is  applied  to  the  skin  between  two  sensors  and  the  skin's  current  conduction  is  measured. 
We  have  used  this  sensor  extensively  for  our  experiments.  We  measure  the  output  voltage  on  our 
processing  units.  Figure  1  shows  a  typical  skin  conductance  sensor  attached  to  our  processing 
unit. 


Figure  1.  Fingertip  skin  conductance  sensor  and  a  processing  unit 


•  Flex  sensor  (flexible  electronic  sensor):  The  Flex  Sensor  is  a  unique  component  that  changes 
resistance  when  bent.  An  un-flexed  sensor  has  a  nominal  resistance  of  10,000  ohms  (10  K).  As 
the  flex  sensor  is  bent  the  resistance  gradually  increases.  When  the  sensor  is  bent  at  90  degrees  its 
resistance  will  range  between  30-40  K  ohms.  We  use  this  sensor  to  measure  the  level  of  activity 
in  a  human.  A  sample  flex  sensor  is  shown  in  Figure  2. 
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Figure  2.  Flex  Sensor 


•  Inertial  sensors  (rigid  sensors):  Inertial  sensors  are  used  to  continuously  track  the  position, 
orientation,  and  velocity  (direction  and  speed  of  movement)  of  an  object  without  the  need  for 
external  references.  Examples  of  motion  sensor  include  Accelerometers  and  Gyroscopes.  An 
accelerometer  is  a  device  for  measuring  the  acceleration  and  gravity  induced  reaction  forces  it 
experiences.  Single-  and  multi-axis  models  are  available  to  detect  magnitude  and  direction  of  the 
acceleration  as  a  vector  quantity.  Accelerometers  can  be  used  to  sense  inclination,  vibration,  and 
shock.  They  are  increasingly  present  in  portable  electronic  devices.  A  gyroscope  is  a  device  for 
measuring  or  maintaining  orientation,  based  on  the  principles  of  angular  momentum.  We  use 
inertial  sensors  (as  shown  in  Figure  3.)  to  measure  the  level  of  activities. 


Figure  3.  Inertial  sensors  attached  to  a  processing  unit  using  custom-designed  board 


•  Humidity  and  Temperature  sensors:  The  humidity  and  temperature  sensors  are  used  to 
measure  effects  of  environment  on  bio-signals.  In  particular,  humidity  and  temperature  affect  the 
skin  conductance.  We  are  required  to  integrate  these  environmental  sensors  to  the  system  to 
compensate  their  effects.  Fortunately,  some  processing  units  are  equipped  with  temperature  and 
humidity  sensors.  We  are  using  a  sensor  board  shown  in  Figure  4. 


Figure  4.  A  processing  unit  integrated  with  humidity  and  temperature  sensors. 


2.2.1.  Sensors  under  development 

•  Piezoelctric  film  sensor:  Piezoelectric  sensors  measure  the  electrical  potential  caused  by 
applying  mechanical  force  to  a  piezoelectric  material.  Piezoelectric  sensors  are  used  in  a  variety 
of  pressure-sensing  applications. 


Figure  5.  Piezoelectric  sensor 


Figure  5  shows  a  typical  piezoelectric  film  sensor  that  we  tested  in  this  project  for  detecting  heart 
beat. 

Figure  6  shows  our  custom-designed  circuit  board.  The  board  is  designed  to  interface  with  the 
piezoelectric  sensor  and  the  sensor  platform.  It  is  designed  to  reduce  environmental  noise  and  amplify  the 
low-amplitude  input  signal.  It  is  built  with  a  TLV2764  TI  amplifier  which  is  a  single  supply  operational 
amplifier  and  provides  500  kHz  bandwidth  from  only  20  uA,  while  operating  down  to  1.8  V  over  the 


industrial  temperature  range.  Our  amplifier  circuit  is  made  to  capture  signal  readings  from  four 
piezoelectric  sensors.  The  schematic  of  this  4-channel  amplifier  is  shown  is  Figure  7  . 


We  built  the  amplifier  circuit  and  tested  it  while  it  was  powered  at  3.0  V  by  a  typical  power  supply. 
The  piezoelectric  film  was  attached  to  the  wrist  with  athletic  tape,  while  the  circuit  output  was  monitored 
with  an  oscilloscope.  It  was  placed  over  the  pulse  point  to  capture  the  subject’s  heart  rate.  As  of  now,  the 
output  we  are  experiencing  has  a  very  low  SNR,  and  the  noise  has  several  different  sources.  After  several 
rounds  of  modification,  we  managed  to  read  and  amplify  heartbeat,  as  shown  in  Figure  8. 


Figure  7.  Schematic  diagram  of  the  amplifier  circuit 


Figure  8.  Piezo-electric  Prototype  and  Heartbeat  Output 


2.3  Gateway 

A  pocket  PC,  mobile  phone,  or  laptop  can  be  used  to  collect  data  from  processing  units  and  classify 
them.  In  our  experiments,  we  have  been  using  a  laptop  as  a  gateway  only  to  log  the  data  for  off-line  signal 
processing  and  analysis. 


3.  Signal  Processing 

We  propose  the  following  framework,  as  shown  in  Figure  9,  for  stress  assessment  and  classification 
of  activities.  Given  the  goal  of  classifying  behavior  based  on  a  subject’s  sensor  readings,  the  functionality 
of  our  automated  pattern  recognition  system  is  divided  into  three  basic  tasks:  the  pre-processing  and 
filtering;  the  description  task  which  generates  attributes  of  a  movement  using  feature  extraction 
techniques;  and  the  classification  task  which  classify  this  the  movement  based  on  its  attributes. 
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Figure  9.  System  architecture  of  distributed  sensing  platform  for  stress  evaluation 


3.1.  Preprocessing 

Currently,  we  only  obtain  readings  from  the  fingertip  sensors.  The  preprocessing  step  involves 
splitting  the  signal  into  periods  of  activity.  This  can  be  accomplished  by  a  thresholding  scheme  that 
attempts  to  find  any  shaip  changes  in  the  pattern  of  the  signal.  For  the  results  presented  in  this  work,  we 
adopted  a  manual  annotation  scheme  to  mark  the  start  and  end  of  each  activity  period. 

3.2.  Feature  extraction 

We  consider  the  following  features  on  fingertip  skin  conductance  sensor  readings: 

i.  Mean:  mean  of  the  signal  segment. 

ii.  Start-to-end  Amplitude:  the  amplitude  change  across  the  signal  segment. 

iii.  STD:  standard  deviation  of  the  signal  segment. 

iv.  Peak-to-peak  amplitude:  max-min  value  of  the  signal  segment. 

v.  RMS:  root  mean  square  of  the  signal  segment. 

vi.  Morphology:  we  select  k  points  that  are  evenly  distributed  from  the  start  to  end  of  each  activity 
segment.  The  distance  between  two  morphological  features  is  defined  as  the  area  between  two 
morphologies.  This  allows  us  to  compare  the  shape  of  segments,  independent  of  their  durations. 

3.3.  Classification 

We  will  utilize  k  nearest  neighbor  (k-NN)  algorithm  to  classify  movements.  We  adopt  k-NN 
classifiers  due  to  1)  the  simplicity  of  implementation,  2)  small  number  of  training  samples  required  and 
small  memory  usage,  and  3)  its  effectiveness.  The  k-NN  classifier  has  been  fully  implemented  and  tested 
in  MATLAB. 

4.  Skin  Conductance  Experiments 

We  have  integrated  fingertip  skin  conductance  sensors  in  our  system.  Preliminary  experiments  have 
been  carried  out  on  three  subjects,  one  female  and  two  males.  The  system  has  been  setup  to  read  from  the 
sensors  every  10  seconds.  A  raw  sensor  reading  for  a  female  subject  is  shown  in  Figure  10.  The  Y-axis  is 
normalized  while  the  X-axis  corresponds  to  time  (unit:  minute). 

A:  Working  in  the  lab  with  a  PC 

B:  Leaving  the  lab,  walking  through  stairs,  inside  building 
C:  Leaving  the  building,  walking  around  campus 
D:  Sitting  (peak  is  due  to  observing  a  cockroach!) 

E:  Walking  home 

F:  At  home  (watching  TV. .  .etc) 

G:  Going  to  bed 

The  diagram  clearly  illustrates  changes  in  sensor  readings  from  one  action  to  another.  The  signal 
processing  described  in  Section  3  has  not  been  incorporated  yet,  although  the  software  modules  for 
pattern  recognition  has  been  implemented  and  tested  in  MATLAB.  This  is  because  the  pulse  wave  sensor, 
which  measures  the  heartbeat  variability  (F1RV),  is  essential  for  our  system  and  has  not  been  yet 
integrated.  Features  from  F1RV  sensors  are  crucial  classifying  stressful  conditions.  In  the  next  months, 
pulse  wave  sensors  will  be  integrated  into  our  system  and  several  experiments  will  be  carried  out. 

Furthermore,  we  will  develop  sensor  elements  that  can  be  integrated  on  ribbons.  The  initial  phase  is 
to  develop  skin  conductance  and  piezoelectric  sensors  that  will  measure  stress  and  heartbeat  rate.  These 
sensors  will  be  placed  on  fabric  and  inside  a  glove. 


Figure  10.  Skin  conductance  reading  (female  subject) 


5.  Physical  Activity  Level  Experiments 

The  activity  level  can  be  used  to  determine  external  physical  stressors  and  to  determine  the  degree  to 
which  skin  conductance  and  heartbeat  relate  to  physical  activity  versus  stress.  Traditionally,  inertial 
sensors  are  used  to  measure  activity  level.  The  most  popular  inertial  sensors  for  this  purpose  are 
accelerometers  and  gyroscopes,  which,  as  mentioned,  measure  acceleration  and  angular  velocity 
respectively.  Here  we  explore  the  use  of  flex  sensors  for  this  application.  Flex  sensors  provide  less 
information  than  inertial  sensors,  but  offer  many  advantages.  For  both  sensor  types,  sensors  must  be 
distributed  across  the  body.  Inertial  sensors  are  constructed  and  deployed  on  a  rigid  surface,  while 
researchers  are  developing  flex  sensors  that  can  be  woven  into  fabric.  Flex  sensors  are  less  vulnerable  to 
impact  damage,  and  are  small  enough  that  redundant  sensors  may  be  placed  to  decrease  vulnerability  to 
damage  and  compensate  for  misplacement.  These  properties  make  flex  sensors  (and  other  fabric- 
embeddable  sensors)  much  more  desirable  for  deployment  on  athletic  individuals  who  are  involved  in 
rigorous  activities.  Furthermore,  the  power  requirements  for  flex  sensors  are  considerably  less:  they  use 
about  1/100  of  the  power  required  for  gyroscopes,  and  1/10  of  that  required  for  accelerometers.  An 
improved  circuit  design  could  further  widen  the  power  gap.  Finally,  the  cost  per  sensor  is  less  for  flex 
sensors. 

For  these  validation  experiments,  we  use  a  test  system  based  on  commercially  available  flex  sensors. 
These  sensors  are  a  quarter  inch  wide,  and  not  embeddable  in  fabric.  However,  the  results  will  largely 
apply  to  a  system  utilizing  such  sensors.  Figure  8  shows  the  flex  sensor  on  the  outside  of  the  elbow  pad 
for  illustration  purposes,  as  the  sensor  is  actually  under  the  pad.  Figure  1 1  shows  as  subject  wearing 
inertial  and  flex  sensors.  The  sensors  are  placed  on  different  joints  including  right-elbow,  right-knee,  left- 
elbow  and  left-knee. 


Figure  11.  Subject  wearing  Flex  sensor  and  Motion  sensor 

A  sample  waveform  for  movement  ‘Stand  to  Sit’  is  shown  in  Figure  12.  In  this  movement,  the  joints 
of  right  and  left  knees  are  mostly  involved.  The  waveform  shows  that  signals  produced  by  sensors  placed 
on  the  knees  are  more  intensive. 
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Figure  12.  A  sample  waveform  for  ‘Stand  to  Sit’  performed  10  times 


5.1.  Activity  Recognition/Automatic  Segmentation 

Before  movements  can  be  properly  classified,  discrete  movements  must  be  separated  from  other 
movements  and  from  rests.  We  developed  a  method  for  segmenting  movements  based  on  standard 
deviation.  The  sensor  data  from  the  sensors  tends  to  change  rapidly  when  the  subject  is  performing  an 
action  and  slowly  when  the  subject  is  not.  We  previously  validated  this  technique  using  gyroscope  and 
accelerometer  data,  but  here  we  show  the  results  using  flex  sensor  data  acquired  in  the  aforementioned 
experiment. 

The  basic  technique  is  to  determine  the  activity  level  for  a  given  sensor  and  time.  The  activity  levels 
from  each  sensor  on  a  given  sensor  node  are  added  together.  Any  activity  level  above  a  threshold  is 
considered  an  action  and  anything  below  is  a  rest.  Not  every  sensor  can  accurately  determine  what  is 
happening,  therefore  the  sensor  nodes  transmit  the  times  and  durations  of  all  actions  they  have  detected 
individually.  The  base-station  then  determines  a  global  segmentation  by  voting  among  the  sensors:  if  at 
least  3/8  of  the  sensors  think  an  action  occurs  at  a  given  time,  it  is  considered  an  action.  Any  actions  or 
rests  that  are  too  short  are  considered  transients  and  are  removed. 

The  activity  level  is  computed  from  the  standard  deviation  of  values  from  a  single  sensor  in  a  window 
around  the  current  point.  The  amplitude  of  this  signal  is  adjusted  using  dynamic  gain  control  to  account 
for  residual  motion  in  rest  states. 
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This  technique  has  several  limitations:  it  cannot  segment  continuous  movements  such  as  walking  or 
running;  it  often  divides  a  single  event  into  several  segments,  and  it  falsely  considers  several  close  events 
to  be  a  single  event.  For  walking,  we  have  developed  an  effective  segmentation  technique  in  which  is  not 
discussed  in  this  report.  The  other  two  problems  can  be  compensated  for  by  changing  what  we  consider  to 
be  the  event:  perhaps  a  movement  does  have  four  parts.  This  is  not  always  an  effective  solution. 
Therefore,  we  are  developing  a  new  algorithm  to  address  these  issues. 
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Figure  15.  Results  for  lifting  an  object 


Figure  14  shows  segmentation  results  for  kneeling.  The  green  signal  is  the  signal  from  the  sensor,  and 
the  red  illustrates  the  activity  level.  The  blue  rectangles  are  the  per-node  segmentation,  and  the  red 
rectangles  on  top  show  the  global  segmentation.  In  Figure  14,  the  segmentation  is  perfect:  it  correctly 
considers  kneeling  to  be  one  action  and  standing  up  another.  A  good  example  of  an  incorrect  separation  is 
shown  in  Figure  15.  Even  if  we  split  the  movement  “lifting  an  object  from  table”  into  four  sub-actions, 
there  are  still  several  cases  where  one  of  those  sub-actions  is  missing.  The  signals  from  the  leg  sensors 
weren’t  shown  as  they  were  consistently  below  the  threshold  for  action  identification. 

Overall,  the  segmentation  was  less  effective  than  in  previous  experiments  involving  inertial  sensors. 
Moreover,  many  of  these  actions  featured  sequential  rather  than  simultaneous  limb  movements  that  led  to 
false  splits  and  joins.  For  instance,  in  the  movement  “bend  and  grasp”  the  legs  bend  first  then  stay  still 
while  the  arms  bend.  This  appears  to  be  two  movements,  although  it  is  incorporated  as  one.  Table  1  was 
compiled  by  manually  judging  how  many  false  positives,  false  negatives,  joins  (two  separate  actions 
considered  one  action),  and  splits  (a  single  action  split  into  two  or  more  separate  actions).  Even  if  one 
action  was  split  into  4  pieces,  it  would  be  counted  as  one  split.  For  movements  six  to  fifteen,  a  sensor 
failed  on  one  of  the  subjects,  resulting  in  fewer  total  movements  listed. 


Table  1.  Segmentation  results 


No 

Total 

FP 

FN 

Joins 

Splits 

1 

55 

5 

0 

0 

3 

2 

58 

6 

0 

0 

24 

3 

60 

T 

2 

8 

11 

6 

40 

4 

6 

1 

11 

9 

40 

0 

2 

0 

0 

10 

44 

0 

7 

0 

1 

11 

19 

0 

0 

0 

19 

12 

42 

5 

0 

1 

0 

13 

38 

7 

0 

0 

0 

14 

40 

1 

1 

0 

0 

15 

32 

0 

0 

0 

0 

5.2.  Feature  Selection 

Electronic  textiles  are  highly  constrained  in  terms  of  computation  and  memory  usage.  That  is,  the 
signal  processing  on  the  fabrics  is  limited  to  low  processing  and  storage.  Since  the  number  of  attributes 
presented  in  the  ‘feature  extraction’  stage  would  directly  affect  system  requirement  in  terms  of 
computation  and  memory,  selecting  a  small  feature  space  that  maintains  acceptable  classification 
accuracy  while  conforming  system  constraints  would  be  very  important.  Therefore,  we  have  tried  to  select 
a  very  small  feature  space  that  achieves  good  classification  accuracy.  We  use  a  heuristic  algorithm,  so- 
called  Relief,  that  works  based  on  a  feature  weighting  scheme  and  finally  provides  direct  insights  into 
choosing  highly-relevant  features.  A  brief  description  of  the  Relief  algorithm  follows. 

Relief  uses  a  statistical  method  to  select  relevant  features.  It  is  a  feature  weight-based  algorithm 
inspired  by  instance-based  learning  algorithms.  From  the  set  of  training  instances,  it  first  chooses  a 
sample  of  instances.  Relief  randomly  picks  this  sample  of  instances,  and  for  each  instance  in  it  finds  Near 
Hit  and  Near  Miss  instances  based  on  a  Euclidean  distance  measure.  Near  Hit  is  the  instance  having 
minimum  Euclidean  distance  among  all  instances  of  the  same  class  as  that  of  the  chosen  instance;  Near- 
Miss  is  the  instance  having  minimum  Euclidean  distance  among  all  instances  of  different  class.  It  updates 


the  weights  of  the  features  that  are  initialized  to  zero  in  the  beginning  based  on  an  intuitive  idea  that  a 
feature  is  more  relevant  if  it  distinguishes  between  an  instance  and  its  Near  Miss,  and  less  relevant  if  it 
distinguishes  between  an  instance  and  its  Near  Hit.  After  exhausting  all  instances  in  the  sample,  it 
chooses  all  features  having  weight  greater  than  or  equal  to  a  threshold.  This  threshold  can  be 
automatically  calculated  using  a  function  that  uses  the  number  of  instances  in  the  sample;  it  can  also  be 
determined  by  inspection  (all  features  with  positive  weights  are  selected).  Relief  works  for  noisy  and 
correlated  features,  and  runs  in  linear  time  relative  to  the  number  of  given  features  and  number  of  samples 
considered  for  feature  selection.  It  works  both  for  nominal  and  continuous  data.  One  major  limitation  is 
that  it  does  not  help  with  redundant  features  and,  hence,  generates  a  non-optimal  feature  set  size  in  the 
presence  of  redundant  features.  This  can  be  overcome  by  a  subsequent  exhaustive  search  over  the  subsets 
of  all  features  selected  by  Relief.  Relief  works  only  for  binary  classes.  This  limitation  is  overcome  by 
Relief-F  that  also  tackles  the  problem  of  incomplete  data. 

For  feature  conditioning,  we  first  extracted  the  following  features  on  flex  sensor  readings. 

1 .  Mean:  mean  of  the  signal  segment. 

2.  Max:  maximum  amplitude  of  the  signal  segment. 

3.  Min:  minimum  amplitude  of  the  signal  segment. 

4.  Peak-to-peak  amplitude:  max-min  value  of  the  signal  segment. 

5.  VAR:  statistical  variance  of  the  signal. 

6.  STD:  standard  deviation  of  the  signal  segment. 

7.  RMS:  root  mean  square  of  the  signal  segment. 

8.  Start-to-end  Amplitude:  the  amplitude  change  across  the  signal  segment. 

9.  Median:  the  middle  sample  data  of  the  signal  segment. 

10.  Slope:  the  slope  of  the  signal  segment. 

11-20.  Morphology:  we  selected  10  points  that  are  evenly  distributed  from  the  starting  to  the  end  of 
each  activity  segment.  The  distance  between  two  morphological  features  is  defined  as  the  area  between 
two  morphologies.  This  assists  us  to  compare  the  shape  of  segments  independent  from  their  time 
durations. 

We  used  the  data  collected  in  the  experiments  mentioned  above  (three  subjects  and  four  sensor 
nodes),  and  we  fed  only  the  training  set  (50%  of  the  collected  data)  into  the  feature  conditioning  system. 
The  input  of  the  system  was  a  training  set  containing  80  features  (20  for  each  sensor  nodes).  We 
performed  a  feature  selection  using  Relief  algorithm  and  selected  only  the  first  four  significant  features. 
The  resulting  features  are  listed  in  Table  2. 


Table  2.  Significant  Features 


No 

Feature  Name 

Sensor  Number 

Weight 

1 

VAR 

2 

100 

2 

VAR 

3 

98 

3 

VAR 

1 

87 

4 

VAR 

4 

28 

To  test  the  effectiveness  of  our  feature  selection  algorithm,  we  fed  the  rest  of  the  collected  data  into 
our  test  set  to  a  central  k-NN  classifier  in  which  we  achieved  82%  accuracy.  We  then  compared  this 
number  with  the  classification  results  reported  for  the  flex  sensor  in  Table  4  (85%)  which  was  attained 
using  80  dimensional  feature  space. 


5.3.  Activity  Level  Analysis 

For  the  experimental  setup,  we  asked  three  subjects  aged  between  20  and  35  to  perform  eleven 
movements  listed  in  Table  3;  ten  trials  each. 


Table  3.  Movements  List  for  Experiments  with  Flex  Sensors 


No. 

Description 

1 

Stand  to  sit 

2 

Sit  to  lie 

3 

Kneeling  and  grasping  (left  leg) 

4 

Reach  up  with  two  hands 

5 

Turning  crotch  counter  clockwise 

6 

Jumping  on  footrest 

7 

Lifting  object  from  table 

8 

Bend  and  grasp 

9 

Going  upstairs  ( 1  step,  right  leg  first) 

10 

Kneeling,  left  leg  first 

11 

Kneeling,  right  leg  first 

The  raw  data  is  collected  from  each  of  the  five  sensors  on  each  of  the  eight  sensor  nodes  at  40Flz.  The 
data  is  filtered  with  a  five-point  moving  average.  We  determine  the  portion  of  the  signal  that  represents  a 
complete  action.  For  experimental  purposes,  this  is  done  by  manually  segmenting  the  actions.  Since  we 
are  developing  different  parts  of  the  system  concurrently,  we  have  not  integrated  our  automatic 
segmentation  module  into  the  classification  tool.  Furthermore,  at  this  step  of  our  project,  we  avoid 
integrating  errors  of  automatic  segmentation  to  the  classification  results.  For  the  feature  selection,  single 
value  features  are  extracted.  We  transform  sensor  readings  into  a  set  of  informative  attributes  as  described 
before.  After  preprocessing  is  completed,  each  sensor  node  transmits  its  local  features  to  the  gateway. 
This  node  is  responsible  for  making  the  final  decision  on  the  type  of  action  done.  We  use  the  k-Nearest 
Neighbor  (k-NN)  classifier  due  to  its  simplicity  and  scalability.  The  final  decision  can  be  made  using 
either  a  data  fusion  or  a  decision  fusion  scheme.  We  utilize  the  former  method  by  feeding  features  from 
all  sensor  nodes  into  the  central  classifier. 

To  verify  the  effectiveness  of  flex  sensors  in  human  motion  detection,  we  compare  the  results  of 
motion  classification  using  flex  sensors  with  the  results  using  motion  sensors.  We  used  50%  of  collected 
data  for  training  the  classifier  and  the  rest  for  testing  the  functionality  of  the  system.  Figure  16  shows  the 
per-action  accuracy  of  the  classification  using  a  k-NN  classifier  where  k=l. 

While  motion  sensors  (including  one  3-axis  accelerometer  and  one  2-axis  gyroscope  per 
sensor  node)  achieve  a  better  classification  accuracy  (overall  accuracy  91%),  flex  sensors  have  a 
reasonable  ability  to  detect  certain  movements.  While  the  flex  sensors  have  less  accuracy  in 
identifying  some  movements  (e.g.  1,  5  and  11),  their  classification  accuracy  is  equivalent  to 
motion  sensors’  for  several  movements  (e.g.  4,  8  and  9)  and  even  higher  for  some  other 
movements  (e.g.  7  and  10). 


Classification  Accuracy 


In  Table  4,  the  overall  accuracy  (using  11 -movement  data  set)  is  shown.  Considering  a  1-NN 
classifier,  when  using  only  flex  sensors,  the  amount  of  reduction  in  accuracy  is  %6  compared  to  the  case 
of  motion  sensors. 


Table  4.  Overall  classification  accuracy  (11  actions) 


K 

Flex  sensor 

Motion  sensors 

Both 

1 

%85 

%91 

%92 

2 

%78 

%92 

%93 

3 

%80 

%93 

%94 

4 

%77 

%90 

%90 

5 

%76 

%88 

%89 

5.4.  Sensitivity  to  Misplaced  Sensors 

Soldiers  can  experience  a  very  high  level  of  activity  everyday.  The  sensors  may  move  during  the  day. 
Therefore,  calibrating  sensor  nodes  to  compensate  for  misplaced  electronics  would  play  an  important  role 
in  designing  this  system  for  use  by  soldiers.  To  test  the  effectiveness  of  flex  sensors  in  such 
environments,  one  of  our  subjects  wore  the  sensor  platform  as  before,  but  with  the  flex  sensors  misplaced. 
We  consider  two  scenarios:  in  the  first,  the  sensors  placed  on  the  right  hand  side  of  the  body  are  moved 


up,  making  them  unable  to  capture  the  whole  action.  In  the  second  scenario,  we  moved  the  flex  sensors 
down  toward  the  ground.  The  subject  did  all  1 1  movements  again.  The  collected  data  in  this  way  was  fed 
into  a  classifier  to  determine  if  the  system  could  detect  the  movements  with  misplaced  sensors.  Figure  17 
shows  the  classification  results.  The  overall  accuracy  for  misplaced  scenario  was  74%. 


Classification  Accuracy 


□  Accuracy 


Figure  17.  Classification  accuracy  for  misplaced  situation  (flex  sensor) 


6.  Conclusion 

We  have  invested  considerable  effort  into  developing  the  individual  pieces  necessary  for  a  wearable 
stress  evaluation  system.  Our  experimental  results  show  that  the  system  is  effective  at  measuring  the 
individual  parameters  necessary  to  determine  stress  level.  The  piezo-electric  heart  rate  sensor  was  the 
most  difficult  component  of  the  system,  however,  its  functionality  was  successfully  verified. 
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Summary 

Over  the  duration  of  the  SPRING  IV  funding  period,  we  have  completed  two  projects, 
leading  to  three  journal  publications  (one  submitted  and  two  under  preparation).  First,  we  have 
developed  a  fast  and  reproducible  method  for  fabrication  of  nanoscale  light  sources  with 
relatively  inexpensive  organic  light  emitting  materials  which  alleviates  charge  spreading  -  a 
phenomenon  that  has  been  detennined  to  be  a  limitation  in  scaling  organic  light  emitting  diodes 
(OLEDs)  down  to  nanoscale  (manuscript  submitted  to  Nanotechnology).  Second,  we  have  fully 
characterized  nanoimprint  induced,  geometry  dependent,  3-D  chain  alignment  in  imprinted 
P3HT  nanostructures.  We  show  preferential  alignment  of  P3HT  in  imprinted  grating 
nanostructures  which  would  allow  efficient  charge  transport  in  devices  with  electric  fields  both 
parallel  (organic  field  effect  transistors)  and  perpendicular  (solar  cells)  to  the  substrate 
(manuscript  in  preparation).  Finally,  we  have  developed  a  novel  process  to  fabricate 
nanogratings  over  large  areas  with  dimensions  as  small  as  10  mn  or  less  (manuscript  in 
preparation). 

I.  High  density  nano-channel  organic  light  emitting  diodes 
Krutarth  Trivedi,  Unnat  Bhansali,  Bruce  Gnade  and  Walter  Hu 

Indium  tin  oxide  (ITO)  coated  glass  (Delta  Technologies)  served  as  OLED  device 
substrates.  Identical  devices  were  fabricated  on  silicon  for  cross  sectional  SEM  imaging  of  the 
device  stack.  The  substrates  were  cleaned  in  acetone  and  isopropyl  alcohol,  treated  with  oxygen 
plasma  and  patterned  by  photolithography  to  define  separate  device  areas.  Nano-channel  OLED 
devices  with  line  and  space  gratings  (100  mn  each)  were  defined  in  SU-8  photo  resist  on  top  of 
the  patterned  ITO  by  NIL  as  shown  in  Fig.  1.  SU-8  is  expected  to  be  stable  during  device 
operation  as  it  has  a  post-crosslink  glass  transition  temperature  of  over  200  °C.  In  addition  to 
having  superior  post-crosslink  insulating  and  mechanical  properties,  it  offers  easy,  low 
temperature  fabrication  (55-85  °C)  by  NIL.  The  SU-8  was  imprinted  with  a  silicon  mold 
(Nanonex),  containing  a  large  area  of  100  mn  line  and  space  gratings  with  an  average  depth  of  95 
mn  (Fig.  2a).  The  mold  was  treated  with  perfluorodecyltrichlorosilane  (FDTS)  to  prevent 
adhesion  of  SU-8  to  the  mold  during  imprint.  The  imprinted  SU-8  patterns  were  cured  under  UV 
(450  mJ/cm2)  and  baked  to  facilitate  cross-linking  of  SU-8.  We  found  an  initial  spin-coated  SU- 
8  film  thickness  of  95  mn  results  in  uniform  75-85  mn  tall  features  with  a  residue  layer  of  30  mn 
over  an  area  of  4  cm  (Fig.  2b).  Thickness  of  the  residue  layer  increases  with  initial  film 
thickness.  Therefore,  it  is  possible  to  control  final  SU-8  structure  height  after  residue  etching 
without  changing  the  mold  height.  To  expose  ITO  between  SU-8  gratings,  the  residue  layer  must 
be  etched  while  maintaining  overall  grating  profile  integrity.  However,  etching  of  the  residue 
layer  without  protecting  the  imprinted  features  would  result  in  a  decrease  in  pattern  height  as 
well  as  rounding  of  the  top  surface  of  the  imprinted  features  which  could  lead  to  confonnal 
coating  of  the  active  materials.  To  selectively  etch  the  residue  layer,  a  hard  masked  residue 
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etching  process  was  developed  using  oblique  angle  evaporation  of  chrome  (Fig.  lc)  at  an  angle 
of  10°,  which  allows  selective  coating  of  metal  only  on  the  top  surfaces  of  SU-8  gratings.  Figure 
2b  shows  a  SEM  micrograph  of  imprinted  patterns  after  oblique  angle  Cr  deposition.  The  residue 
layer  was  etched  in  an  inductively  coupled  oxygen  plasma  (5  mTorr,  25  °C,  300  watt  ICP  power, 
100  watt  bias  power).  The  samples  were  etched  for  10  seconds  to  account  for  any  non¬ 
uniformity  in  the  residue  layer,  assuring  complete  residue  removal.  The  bulk  of  the  residue  is 
likely  etched  away  in  approximately  three  seconds  while  the  ITO  is  further  cleaned  by  the 
oxygen  plasma  for  the  remainder  of  the  etch.  Figure  2c  shows  a  SEM  micrograph  of  the 
imprinted  SU-8  patterns  after  residue  etch,  with  Cr  remaining  on  top  of  the  patterns.  After 
residue  etching,  the  samples  undergo  a  two  minute  wet  etch  with  CR-9S  Cr  wet  etchant  to 
remove  the  Cr  mask,  leaving  only  isolated  SU-8  nano-channel  gratings  with  ITO  exposed 
between  gratings.  CR-9S  Cr  etchant  is  routinely  used  in  photomask  fabrication  and  is  not 
expected  to  damage  oxides  such  as  ITO.  This  process  produces  rectangular  gratings  with  sharp 
and  vertical  sidewalls,  which  is  expected  to  prevent  highly  conformal  coating  during  thermal 
evaporation  of  organics.  In  addition  to  nano-channel  SU-8  gratings,  we  fabricated  micrometer 
scale  SU-8  line  and  space  patterns  with  dimensions  of  two,  five  and  ten  micrometers  by  a 
standard  photolithography  process.  Initial  film  thickness  was  calibrated  to  yield  SU-8  gratings 
with  the  same  height  as  nano-channel  gratings. 

Next,  N,N’-Bis(naphthalen-l-yl)-N,N’-bis(phenyl)benzidine  (NPB),  tris-(8- 
hydroxyquinoline)  aluminum  (Alq3),  calcium  (Ca),  and  silver  (Ag)  were  deposited  into  the  nano¬ 
channel  and  microscale  SU-8  gratings  by  thermal  evaporation,  which  is  expected  to  produce 
uniform  but  more  non-conformal  filling  in  the  SU-8  nanotrenches  without  significant  sidewall 
coating.  For  all  devices,  30  mn  of  NPB  was  deposited,  followed  by  75  mn  of  Alq3,  50  mn  of  Ca 
and  100  mn  of  Ag.  The  thickness  of  the  individual  organic  materials  inside  the  nano-channel 
gratings  was  carefully  calibrated,  as  it  can  be  vastly  different  from  blanket  deposition,  which  is 
used  in  conventional  OLED  device  fabrication.  Significant  shadowing,  caused  by  angled 
incident  flux  of  organics  inside  the  high  aspect  ratio  trenches  of  nano-channel  devices  can  occur 
during  thermal  evaporation  as  material  is  deposited  in  between  and  on  top  of  the  SU-8  gratings, 
resulting  in  non-uniform  coating  inside  trenches.  Consequently,  the  effect  of  shadowing 
increases  as  more  material  is  deposited  on  top  of  the  SU-8  gratings,  as  there  is  an  effective 
increase  in  the  aspect  ratio  of  the  trenches.  Problematic  shadowing  can  be  alleviated  by  holding 
the  substrates  stationary  over  the  organic  sources  during  thennal  evaporation.  Devices  fabricated 
on  silicon,  with  identical  nano-channel  patterns,  are  placed  alongside  devices  made  on  ITO 
coated  glass  in  order  to  examine  the  filling  inside  the  gratings  with  cross  sectional  SEM.  Figure 
2d  shows  a  SEM  micrograph  of  a  typical  nano-channel  OLED  device.  The  organics  are 
deposited  in  the  trenches  as  well  as  on  top  of  the  SU-8  gratings.  However,  in  contrast  to  spin- 
coated  active  materials,  the  organic  materials  are  deposited  in  a  highly  non-conformal  manner, 
with  a  negligible  amount  of  material  on  the  sidewalls.  The  grating  dimensions  and  filling 
properties  are  expected  to  be  similar  for  devices  fabricated  on  ITO  coated  glass. 

All  devices  underwent  electrical  and  optical  characterization  with  a  Keithley  2400  power 
supply  and  a  photoresearch  PR-650  Spectra  Photometer  interfaced  with  Labview.  The  OLED 
devices  were  biased  only  up  to  10  volts  to  increase  the  lifetime  of  the  device  for  repeated  testing. 
Figure  3  shows  the  electrical  and  optical  characteristics  of  nano-channel  and  microscale  OLED 
devices  fabricated  on  ITO  coated  glass.  As  seen  in  Fig.  3a,  the  current  density  of  nano-channel 
OLED  devices  is  observed  to  be  higher  than  that  of  microscale  devices.  Current  in  OLED 
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devices  is  dependent  on  various  parameters  such  as  materials  used  and  active  layer  thickness  but 
less  current  in  nano-channel  OLED  devices  could  be  attributed  to  reduction  of  charge  spreading 
by  successful  isolation  of  light  emitting  trenches  from  non-emitting  SU-8  gratings,  which  is 
desired  for  discrete  nanoscale  light  sources.  Fig.  3b  shows  luminescence  of  microscale  grating 
devices  to  be  higher  than  the  nano-channel  OLED  devices.  The  data  shows  that  light  output 
from  the  2  pm  grating  device  is  not  only  brighter  than  the  5  and  10  pm  grating  devices,  but  the 
nano-channel  devices  as  well.  Hence,  there  is  no  clear  geometry  dependence  in  the  brightness  of 
OLED  devices  of  varying  grating  size.  At  low  bias,  the  nano-channel  devices  exhibited  slightly 
lower  brightness  than  the  microscale  grating  devices  while  having  a  higher  current  density. 
However,  nano-channel  devices  were  brighter  than  microscale  grating  devices  at  higher  current 
densities,  beyond  applied  bias  of  10  volts,  reaching  luminescence  values  exceeding  1000  Cd/nU 
at  15  volts.  The  observed  higher  current  density  paired  with  relatively  lower  luminescence  at 
low  bias  are  not  related  to  scaling  down  of  the  dimension  of  light  emitting  trenches.  Slight 
variation  between  devices  is  likely  due  to  processing  conditions  from  sample  to  sample.  The 
process  flow  and  conditions  can  certainly  be  optimized  for  better  device  performance.  The 
electrical  and  optical  characterization  data  of  the  nano-channel  OLEDs  show  no  geometry 
dependence  and,  therefore,  indicate  that  miniaturization  of  OLEDs  to  nanoscale  would  result  in 
similar  performance  as  microscale  devices,  which  makes  them  feasible  as  nanoscale  light  sources 
for  near-field  applications. 

SPM  was  performed  under  ambient  conditions  with  white  light  on  both  microscale  and 
nano-channel  OLED  devices  without  metal  cathodes.  Metal  was  not  deposited  so  that  the 
surface  of  the  organics  can  be  scanned  across  the  morphology  of  the  devices.  A  Veeco 
Dimension  V  atomic  force  microscope  system  was  set  up  for  amplitude  modulation  (AM  mode) 
SPM  with  a  conductive  ID  silicon  tip  having  a  spring  constant  of  40  N/m.  An  a.c  voltage  of  4 
volts  was  superimposed  on  the  tip,  while  a  d.c  voltage  of  8  volts  was  applied  to  the  sample, 
which  caused  the  tip  to  oscillate  in  response  to  the  electrostatic  forces  on  the  surface  of  the 
sample.  During  the  surface  potential  scan,  the  tip  is  held  5  mn  above  the  surface  of  the  sample  as 
the  surface  potential  is  recorded.  Figure  4  shows  SPM  micrographs  of  the  surface  potential 
across  several  gratings  of  the  microscale  and  nano-channel  OLED  devices  as  well  as  light 
emission  profiles  taken  under  an  optical  microscope  (insets).  The  change  in  surface  potential 
corresponds  to  the  grating  topography.  A  distinct  difference  in  the  surface  potential  on  top  of  the 
SU-8  gratings  and  the  trenches  is  immediately  visible.  In  all  devices,  a  higher  surface  potential 
is  observed  on  top  of  the  SU-8  structures,  indicating  a  non-uniform  charge  distribution  between 
the  top  of  the  SU-8  gratings  and  the  trenches.  Table  I  shows  the  same  recorded  average 
difference  in  surface  potential  (ASP)  of  organics  on  top  of  SU-8  gratings  and  inside  trenches  for 
all  devices,  regardless  of  geometry.  The  slight  reduction  in  contrast  and  image  quality  for  nano¬ 
channel  gratings  (Fig.  4d)  is  expected  in  AM  mode  as  it  is  carried  out  in  ambient  conditions. 
More  importantly,  long  range  electrostatic  forces  affect  the  readings  in  AM  mode,  making  it 
difficult  to  differentiate  changes  in  surface  potential  that  are  too  closely-spaced.  The  SPM 
measurements  presented  show  a  periodic  difference  in  surface  potential  of  organics  inside 
trenches  and  on  top  of  SU-8  gratings.  This  distinct  periodic  difference  in  surface  potential  can 
be  attributed  to  electrical  separation  of  the  organic  stacks  inside  trenches  and  on  top  of  the  SU-8 
gratings.  Furthennore,  electrical  separation  of  organic  stacks  is  evident  in  the  light  emission 
profiles  of  microscale  grating  devices  (insets  of  Fig.  4,b,c,d).  Light  is  only  output  from  the 
trenches,  indicating  electron-hole  recombination  is  clearly  confined  to  the  organics  inside 
trenches.  This  absolute  confinement  of  the  recombination  region  can  be  attributed  to  the 
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confinement  of  holes  to  the  NPB  inside  trenches  despite  any  thin  layer  of  NPB  on  the  sidewalls. 

It  is  apparent  from  the  light  emission  profiles  of  microscale  grating  devices  that  charge  spreading 
is  not  a  problem  in  devices  fabricated  only  with  thermally  evaporated  NPB.  Unfortunately,  due 
to  diffraction  limitations  of  optical  microscopy  similar  discrete  light  emission  profile  from  nano¬ 
channel  OLED  devices  cannot  be  observed  directly.  Nevertheless,  the  similar  ASP  observed  in 
the  microscale  and  nano-channel  OLED  devices  indicates  that  light  emission  profile  of  nano¬ 
channel  OLED  devices  should  also  be  discrete  with  light  output  only  from  the  nanoscale  trenches 
in  between  the  SU-8  gratings,  much  like  light  emission  from  microscale  devices. 

II.  Nanoimprint  induced,  geometry  dependent  3-D  alignment  in  imprinted  P 3 HT  nanostructures 

Mukti  Aryal,  Krutarth  Trivedi  and  Walter  Hu 

Uniform  80  mn  thin  films  of  P3HT  were  prepared  on  a  silicon  substrate  and  soft-baked  at 
150  °C  for  5  minutes  on  a  hot  plate.  The  films  were  imprinted  with  nanograting  and  nanoporous 
molds  coated  with  lH,lH,2H,2H-perfluorodecyltrichlorosilane  (FDTS)  as  per  the  process  flow 
shown  in  Fig.  5.  The  nanograting  mold  consists  of  200  mn  deep,  65  mn  wide  gratings  at  a  period 
of  200  mn  (Fig.  6a)  while  the  nanoporous  mold  consists  of  a  hexagonal  array  of  350  mn  deep 
nanopores  with  80  mn  diameter  (Fig.  6b).  Nanoimprinting  was  done  at  170  C  for  10  minutes, 
after  which  the  samples  were  de-moled  at  25  C.  P3HT  nanogratings  and  nanopillar  structures 
were  obtained  after  releasing  the  mold  from  the  sample  (Fig.  7).  With  careful  control  of  spin¬ 
coating  and  imprint  conditions  it  is  possible  to  control  the  thickness  of  the  residue  layer  of 
imprints.  As  seen  in  the  figure,  the  residue  layer  is  controlled  to  be  20  mn. 

P3HT  crystallization  and  chain  orientation  were  studied  by  in-plane  and  out-of-plane 
grazing  incident  XRD  measurements.  With  this  measurement  only  the  lattice  distances  in  the  z- 
direction  are  detected  with  the  (001)  peak  generally  not  detectable  in  any  case.  The  schematic  of 
face-on  and  chain-on  orientations  is  given  in  the  inset  in  Fig.  8a,  ii  and  iii,  respectively.  The 
graph  in  Fig  8b.  shows  out-of-plane  measurement  of  imprinted  and  non-imprinted  thin  film 
samples.  Only  the  (100)  peak,  corresponding  to  lattice  parameter  a=1.69  mn,  is  observed, 
indicating  edge-on  orientation  (Fig.  8a,  iii)  is  dominant  in  all  samples.  .  However,  the  lower 
intensity  of  the  (100)  peak  for  the  nanoimprinted  samples  (gratings  and  pillars)  indicates  re¬ 
ordering  of  P3HT  chains  during  nanoimprint.  In-plane  measurement  is  conducted  to  investigate 
the  precise  ordering  of  chains  with  respect  to  grating  direction.  As  seen  in  figure  9,  the  (010) 
peak  at  23.4°,  corresponding  to  the  lattice  parameter  b=0.38nm,  was  detected  for  the  non- 
imprinted  thin  film  sample,  consistent  with  the  (100)  peak  observed  in  the  out-of-plane 
measurement  in  Fig.  8  (blue  curve).  This  is  because  edge  on  orientation  observed  in  out  of 
plane  measurement  means  to  7i-staking  parallel  to  the  substrate  or  x-y  plane  which  could  be 
observed  in  in-plane  measurement.  Therefore  both  measurements  show  the  dominance  of  edge 
on  orientation  in  non  imprinted  P3HT  film.  The  reordering  of  polymer  chain  nanopillar 
structures  is  proved  since  (100)  peak  in  the  graph  of  this  sample  (blue)  dominates  the  same  for 
the  non  imprinted  sample  (black).  The  increase  in  (100)  peak  intensity  nanopillar  structures 
compared  to  the  thin  film  in  in-plane  measurement  means  that  the  lattice  parameter  a  which  was 
observed  in  z-direction  in  thin  film  reordered  to  become  parallel  to  the  substrate.  The  lattice 
parameter  b  is  parallel  to  the  substrate  since  (010)  was  detected  only  in  the  in-plane.  With  this,  c- 
axis  must  be  normal  to  the  substrate.  Therefore  the  polymer  chain  re-ordering  of  c-axis  normal  or 
the  chain  alignment  with  vertical  polymer  backbone  was  conformed  in  nanopillar  structure. 
However,  the  chain  ordering  in  nanograting  structure  is  entirely  different. 


5 


The  intensity  of  the  peak  (100)  varies  with  the  sample  rotation  in  nanograting  sample. 
When  the  sample  was  scanned  placing  it  in  the  thin  film  stage  making  grating  direction  parallel 
to  the  source  the  peak  (100)  intensity  becomes  huge  and  the  intensity  drops  to  noise  level  while 
the  sample  was  placed  making  nanograting  direction  perpendicular  to  the  source.  Just  opposite  of 
this,  the  intensity  of  (010)  peak  was  detected  while  scanning  in  perpendicular  direction  and  that 
was  dropped  to  noise  level  when  scanning  in  parallel  direction.  This  means  the  reordering  of 
polymer  chain  depends  on  the  geometry  of  the  nanostructures.  The  chain  alignment  in  nanopillar 
structure  is  isotropic  in  x-y  plane  since  the  sample  rotation  does  not  make  any  difference  in  the 
(100)  peak  intensity  while  the  difference  in  (100)  peak  intensity  in  nanograting  corresponding  to 
sample  rotation  shows  the  anisotropy  of  chain  orientation  in  x-y  plane.  While  scanning  parallel 
to  the  nanograting  direction  XRD  detects  chain  ordering  corresponding  to  the  lattice  parameter  a 
i.e.  side  chain  faces  to  x-direction  as  shown  in  Fig.  5.a.ii.  and  while  scanning  perpendicular  to  the 
nanograting  direction  XRD  detects  chain  ordering  corresponding  to  the  lattice  parameter  b  i.e.  p- 
stacking  faces  to  x  direction  as  given  in  Fig.  5.a.iii.  This  only  means  that,  1)  7i-stacking  direction 
corresponding  to  lattice  parameter  b  is  along  the  line  grating  direction  parallel  to  the  substrate,  2) 
the  lattice  parameter  a  is  perpendicular  to  the  line  grating  direction  and  parallel  to  the  substrate, 
and  3)  the  lattice  parameter  c  is  normal  to  the  substrate  or  the  chains  are  vertically  aligned. 

Further  to  investigate  the  distribution  of  anisotropic  chain  orientation  nanogratings,  an 
azimuthal  ((j))  in  plane  prefer  orientation  scan  with  detector  fixed  at  an  angle  20%  5.2° 
corresponding  to  (100)  peak  was  perfonned.  The  sample  was  rotated  from  -40°  to  115°  i.e.  (j) 
goes  from  -40°  to  115°.  The  peak  maximum  was  observed  when  the  angle  between  the  source 
and  nanograting  is  about  3°.  It  shows  that  a-axis  orientation  centered  perpendicular  to  the  line 
grating  direction  and  the  distribution  ranges  from  -20  and  +20  which  is  shown  in  the  Fig.  10. 

Ordering  of  P3HT  chains  inside  nanostructures  during  nanoimprint  can  be  assesed  by 
considering  several  fundamental  theories  from  literature.  There  are  three  underlying  reason 
behind  the  ordering  of  P3HT  chains  in  nanostrucures:  1)  Flow  induced  ordering  into  nanoscale 
dimensions  at  molten  state  under  high  pressure;  2)  Nanoscale  confinement  of  polymer  and 
hydrophobic  interaction  between  side  walls  of  the  Si  mold  and  P3HT  side  chains  due  to  surface 
treatment  effects;  3)  Self  assemby  of  polymer  due  to  side  chain-side  chain  interaction  and 
interaction.  Polymer  disentanglment  commences  with  the  application  of  heat  during  nanoimprint, 
followed  by  flow  in  the  nanoscale.  As  this  process  continues,  polymer  chains  start  aligning  along 
the  flow  direction,  and  most  prominently  along  the  sidewalls  of  the  nanostrucures.  At  high 
temperatures  (170  C  in  our  experiments),  individual  molecular  chains  rotate  freely  along  the 
backbone,  with  the  hydrophobic  side  chains  preferably  turning  towards  the  FDTS  treated  side 
walls  of  the  nanostrucures  on  the  mold.  Thus,  crystallization  begins  when  the  hydrophobic  side 
chains  of  P3HT  interact  with  the  sidewalls  of  the  mold,  and  this  effect  is  cascaded  throughout  the 
imprinted  P3HT.  This  cascade  effect  continues  throughout  theduration  of  the  imprint,  while 
reaching  completion  as  the  temperature  is  cooled  down  to  room  temperature.  The  chains  cease  to 
move  as  the  temperature  is  reduced,  fonning  comb-like  polymeric  structures.  Finally,  the  rod¬ 
like  segments  are  ordered  in  a  parallel  lamella-type  stack  with  interdigitized  alkyl  side  chains  to 
achieve  maximum  van  der  Waals  interactions.  Therefore  the  jt-jt  interchain  interactions  and  the 
side  chain  hydrophobic  interactions  assist  in  formation  of  RR-P3Htcrystallites  and  hence  stable 
3-D  chain  alignment  is  possible.  The  effect  of  geometry  is  evident  in  the  case  of  nanogratings, 
which  exhibit  preferential  orientation  of  7i-stacking  along  the  nanograting.  The  less  intense  (100) 
peak  for  nanopillar  structures  compared  to  the  parallel  orientation  of  nanograting  by  in-plane 
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measurement  (Fig.  9)  indicates  that  the  P3HT  side  chains  in  naopillar  structures  turn  around  the 
walls  of  nanopores  and  hence  becomes  isotropic  in  the  x-y  plane.  Since  most  of  the  P3HT  chains 
which  were  edge  on  oriented  in  P3HT  film  are  reordered  to  c-axis  nonnal,  the  out  of  plane  (100) 
peaks  for  nanograting  and  nanopillar  structures  are  less  intensive  than  that  of  the  thin  film  as 
shown  in  Fig.  8.  However,  the  presence  of  (100)  peak,  though  less  intensive,  indicates  the 
presence  of  highly  crystalline  edge  on  oriented  P3HT  chain. 


The  gaussian-like  curve  in  figure  10  shows  the  distribution  of  alignment  modes  in  the 
nanogratings.  The  distribution  ranges  from  -20°  to  +  20°  which  means  that  the  7i-stacking  are 
tilting  up  to  20°  with  the  side  wall  on  both  sides.  Highly  ordered  polymer  chain  induced  by 
polymer  flow  into  nanostructures  and  nanoconfinement  with  in  hydrophobic  surface  has  been 
clearly  observed.  Regio-random  polymer  chains  present  in  the  nanostructures  causes  the  kink  or 
chain  folding  so  that  the  continuous  crystalinity  breaks  to  form  crystal  domains  with  amorphous 
boundries.  Therefore  nanoimprinting  was  proven  to  have  dominant  effect  on  polymer  chain 
alignment  and  the  nanoimprinting  conditions  such  as  time,  tempearture,  pressure  and  starting 
film  thickness  can  be  controlled.  Another  important  factor  is  the  geometry  effect  which  is  seen  in 
nanopillar  and  nanograting  structures. 

In  summary,  we  have  shown  that  P3HT  chain  ordering  is  significantly  impacted  by 
nanoconfinement  in  nanostrutured  geometries  during  nanoimprint  lithogprahy.  The  extent  and 
morphology  of  chain  ordering  can  be  controlled  by  mold  geometry  as  well  as  nanoimprint 
process  conditions.  The  mechanism  of  ordering  and  alignment  is  discussed  and  is  found  to  be 
supported  by  fundamental  studies  conducted  by  several  authors.  Preferential  chain  alignment 
and  control  of  3-D  morphology  was  achieved  and  clearly  examined  by  XRD.  The  resulting 
aligned  P3HT  nanostrucures  are  believed  to  offer  superior  charge  transport  properties  in  both 
solar  cell  and  FET  applications.  We  believe  that  the  n-n  stacking  along  the  nanograting  would 
enhance  mobility  FETs  while  the  vertical  (c-axis  normal)  alignment  and  n-n  stacking  across  the 
nanograting  would  facilitate  enhanced  charge  separtation  in  solar  cells.  Further  enhancement  of 
polymer  alignment  and  crsytallinity  can  be  achieved  by  fine-tuning  imprint  conditions  as  well  as 
polymer  quality. 


III.  Dimension  reduction  of  gratings  by  controlled  reflow  of patterned  polymer  mask 
Krutarth  Trivedi  and  Walter  Hu 

In  this  project,  we  have  also  developed  a  simple  and  well  controlled  process  of  dimension 
reduction  to  single-digit  nanometer  regime  by  controlled  reflow  of  lithographically  (nanoimprint 
lithography)  defined  polymer  lines  to  reduce  the  slope  of  the  grating  sidewall,  followed  by 
oblique  angle  metal  evaporation  and  inductively  coupled  plasma  (ICP)  etching  (Fig.  1 1).  This 
low-temperature  method  offers  good  dimensional  control  and  is  applicable  to  a  wide  variety  of 
materials.  One  application  to  make  nanoimprint  molds  with  narrow  trenches  (-T0  nm)  is 
demonstrated.  First,  50K  PMMA  is  imprinted  with  a  mold  containing  a  large  area  of  100  nm 
line  and  space  gratings  (Nanonex).  Then,  the  sample  is  heated  above  the  glass  transition 
temperature  of  PMMA,  causing  partial  melting  of  the  imprinted  PMMA  gratings  while  still 
preserving  the  overall  imprinted  profile.  Extent  of  melting,  characterized  by  the  change  in 
sidewall  slope  of  imprinted  lines,  can  be  well  controlled  by  duration  (Fig.  12)  and  temperature  of 
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heating.  Then,  a  metal  layer  is  deposited  onto  the  sloped  PMMA  gratings  by  oblique  angle 
evaporation.  By  adjusting  the  angle  during  metal  evaporation,  it  is  possible  to  control  the  size  of 
the  opening  in  the  metal  coating  on  the  imprint  profile  (Fig.  13).  The  metal  serves  as  a  mask  for 
etching  of  exposed  PMMA,  between  openings  in  the  metal,  by  oxygen  ICP  as  well  as  transfer  of 
patterns  to  the  oxide  layer  by  ICP  etching  in  a  mixture  of  C4F8,  CHF3  and  Ar.  Using  this 
process,  the  trench  width  can  be  precisely  controlled  down  to  as  small  as  10  mn  or  less  (reduced 
from  original  100  mn  line  and  space  PMMA  gratings)  and  resulting  patterns  exhibit  excellent 
quality  and  controllability  (Fig.  14). 
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Figures  and  Tables: 


Figure  1:  Nano-channel  OLED  process  flow  a)  Si  mold  is  pressed  into  the  SU-8  film  on  ITO 
coated  glass;  b)  the  mold  is  removed,  leaving  imprinted  SU-8  gratings;  c)  Cr  is  coated  on  top  of 
the  imprinted  SU-8  gratings  at  an  oblique  angle;  d)  the  residue  is  etched  by  CE  plasma,  with  the 
Cr  protecting  the  SU-8  gratings;  e)  the  imprinted  SU-8  gratings  are  isolated  after  O2  plasma  etch; 
f)  organics  and  metal  are  deposited  by  thermal  evaporation. 
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Figure  2:  SEM  micrographs  showing  cross  sections  of  a)  the  Si  mold  used  to  imprint  the  SU-8; 
b)  imprinted  SU-8  gratings  after  oblique  angle  Cr  evaporation;  c)  isolated  SU-8  gratings  with  Cr 
on  top  after  O2  plasma  etch;  d)  a  typical  nano-channel  OLED  device  after  thermal  evaporation  of 
organics  and  metal. 
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Figure  3:  Electrical  characterization  data  showing  a)  current  density  vs.  voltage  characteristics; 
and  b)  luminescence  vs.  voltage  characteristics  of  microscale  and  nano-channel  OLED  devices. 
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Figure  4:  Surface  potential  micrographs  with  insets  showing  light  emission  profile  under  optical 
microscope  of  a)  2  pm  grating;  b)  5  pm  grating;  c)  10  pm  grating;  and  d)  nano-channel  OLED 
devices. 


Table  1 


Grating  Dimension 

Average  ASP 

10  pm  gratings 

64  mV 

5  pm  gratings 

63  mV 

2  pm  gratings 

68  mV 

100  nm  gratings 

65  mV 
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Figure  5:  Schematic  of  nanoimprinting  process  and  P3HT  chain  alignment  during  flow  into 
nano-gratings  of  the  mold. 
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Figure  6:  SEM  top  view  of  Si  molds  a)  nanogratings  of  period  200  nm,  width  65  nm  and  depth 
200  nm  b)  hexagonal  array  of  nanopores  of  diameter  80  nm  and  height  350  nm,  separation  of  30 
nm.  Insets  show  cross  sectional  views.  Scale  bar  200  nm. 
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Figure  7:  SEM  images  of  45°  tilted  view  of  P3HT  a)  nanogratings  of  period  200  nm,  width  65 
nm,  height  200  nm  and  the  residual  layer  of  20  nm,  b)  hexagonal  array  of  nanopillars  of  diameter 
80  nm  and  height  150  nm,  separation  of  30  nm,  with  residual  layer  of  20  nm.  Insets  show  cross 
sectional  views.  Scale  bar  is  2  pm  and  200  nm  for  the  tilted  view  and  insets,  respectively. 
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Figure  8:  a)  Schematic  of  i)  out  of  plane  measurement  with  fixed  source  (S)  and  detector  (D) 
moving  in  the  vertical  plane,  ii)  edge  on  orientation  iii)  face  on  orientation  for  the  detection  of 
lattice  in  z  direction  b)  out  of  plane  XRD  measurement  for  the  detection  of  lattice  normal  to  the 
substrate.  The  detection  of  edge  on  orientation  conforms  edge  on  chain  alignment  of  P3HT 
structures  for  thin  film  (blue),  nanogratings  (black)  and  nanopillar  (red).  The  decrease  in  (100) 
peak  intensity  indicates  the  rearrangement  of  P3FIT  chain  due  to  nanoimprinting. 
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Figure  9:  a)  Schematic  of  XRD  i)  in-plane  measurement  with  fixed  source  (S)  and  detector  (D) 
moving  in  the  horizontal  plane;  detection  of  ii)  (100)  and  iii)  (010)  lattice  orientation  on  x- 
direction,  b)  out  in-plane  XRD  measurement  for  the  detection  of  lattice  along  x-direction.  Red 
curve  for  the  nanograting  when  aligned  along  y-direction  and  green  for  that  when  aligned  along 
x-direction,  blue  is  for  nanopillars  and  black  for  the  film.  The  difference  between  red  and  green 
curve  conforms  the  preferential  chain  orientation  of  P3HT  chain  in  nanograting  as  shown  in 
schematic  a). 
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Figure  10:  XRD  in-plane  <|)  scan  measurement  for  the  grating  structures.  Inset  shows  the 
schematic  of  the  measurement.  The  sample  is  placed  so  that  when  4>=0°  the  sample  is  aligned 
with  d=0°  cross  hair  position  as  shown  and  the  nanograting  become  parallel  to  the  incident  beam. 
The  incident  beam  makes  an  angle  d  with  the  direction  of  nanograting  when  the  thin  film  stage 
rotated  through  an  angle  ()>■  The  detector  is  fixed  at  2  0  =  5.2°. 


18 


Figure  1 1 :  Schematic  showing  imprinted  (a),  and  then  melted  PMMA  lines  (b),  oblique  angle 
metal  evaporation  on  melted  lines  (c)  and  ICP  etch  to  substrate  (d). 


Heating  Time  (minutes) 


Figure  12:  SEM  X-section  showing  imprint  profiles  with  measured  sidewall  slope  angle  before 
(a)  and  after  heat  treatment  at  125°C  for  30  seconds  (b),  1  minute  (c),  and  2  minutes  (d)  (scale 
bar  is  shared);  (e)  Sidewall  slope  angle  of  imprinted  gratings  heated  at  125°C  as  a  function  of 
heating  time. 


Metal  Evaporation  Angle  (°) 


Figure  13:  SEM  X-section  showing  imprinted  profiles  after  oblique  angle  Cr  evaporation  on 
gratings  without  heat  treatment  (a)  and  with  heat  treatment  (b);  (c)  Etched  trench  dimension  in 
oxide  as  a  function  of  metal  evaporation  angle. 


Figure  14:  SEM  X-section  showing  patterns  etched  into  oxide  after  oblique  angle  Cr  evaporation 
on  melted  imprint  profile  at  10°  (a),  20°  (b)  and  30°  (c). 
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Final  Report 


SPECTROSCOPY  OF  ELECTRONIC  EXCITATIONS  OF  CARBON 
NANOTUBES  IN  POLAR  ENVIRONMENTS 

UTD-SPRIN G / AFOSR  Project 

PI:  Dr.  Yuri  Gartstein  (Department  of  Physics) 


I.  Overiview 

The  scope  of  our  work  in  this  Project  has  been  theoretical  studies  and  modeling  of  charged 
and  neutral  excitations  of  carbon  nanotubes  and  related  one-dimensional  (ID)  semiconductor 
nanostructures  (such  as  nanowires  and  conjugated  polymers)  as  affected  by  surrounding 
polarizable  environments.  A  particular  focus  has  been  on  the  effect  of  solvation  of  excess 
charge  carriers  by  the  slow  (orientational)  polarization  of  the  surrounding  polar  solvent.  This 
is  the  situation  of  particular  relevance  to  applications  and  processes  involving  fundamental 
redox  reactions,  including  (photo)electrochemistry.  It  should  be  noted  that  redox  chemistry 
of  carbon  nanotubes  has  been  deemed  in  the  literature  as  “emerging  held  of  nanoscience” 
and  solvatochromic  effects  are  being  increasingly  researched.  As  a  result  of  solvation,  excess 
charge  carriers  can  become  self-localized  forming  large-radius  ID  adiabatic  polarons  whose 
properties  are  distinctly  different  from  the  parent  band  electronic  states.  The  energetic 
significance  of  the  solvation  has  been  emphasized  by  hireling  that  the  binding  energy  of  the 
resulting  ID  polarons  could  reach  a  substantial  fraction,  roughly  one  third,  of  the  binding 
energy  of  Wannier-Mott  excitons,  the  well-known  primary  photoexcitations  in  many  ID 
semiconductors.  This  may  lead  to  enhanced  charge  separation.  On  the  other  hand,  the 
mobility  of  solvated  charge  carriers  is  drastically  reduced  due  to  the  dissipative  drag  of  the 
medium.  Our  emphasis  was  on  clarifying  some  conceptual  issues  and,  mostly,  on  working 
out  experimentally  testable  signatures  of  such  solvation-induced  polarons. 

We  have  been  keeping  our  counterparts  at  Rice  University  abreast  of  our  findings  that 
could  serve  as  theoretical  expectations  for  future  experiments.  In  turn,  experimental  data 
from  Rice  on  the  effects  of  hydrocarbon  environments  on  excitonic  transition  in  carbon  nan¬ 
otubes  is  being  currently  employed  by  us  in  evaluations  of  the  dynamical  dielectric  effects 
on  electron-  and  hole-quasiparticles. 

1.  Personnel  involved 

The  following  personnel  received  various  amounts  of  financial  support  from  the  funds  pro¬ 
vided  by  this  grant:  (1)  Yuri  Gartstein  (PI);  (2)  Tony  Bustamante  (undergraduate  student); 
(3)  Michael  Mayo  (graduate  student);  (4)  Geoffrey  Ussery  (graduate  student). 

Due  to  the  mismatch  of  the  project  period  endings  (April  30)  and  the  official  UTD 
semester  duration  dates,  no  student  support  during  Spring  semesters  was  possible  to  ar¬ 
range.  This  resulted  in  leftover  personnel  support  funds,  which  could  be  used  to  support  the 
in-progress  research,  should  a  no-cost  extension  be  granted. 

2.  Student  training  and  development 

All  students  involved  in  the  project  have  been  receiving  training  expanding  their  knowl¬ 
edge  of  related  physical  and  chemical  aspects  of  the  problems.  No  less  important,  they 
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substantially  improved  their  practicing  skills,  both  conceptual  and  computational,  and  had 
multiple  opportunities  to  appreciate  the  synergy  of  ideas  and  methods  characteristic  of  mod¬ 
ern  nanoscience. 

Their  research  within  the  framework  of  the  project  has  also  resulted  in  important  personal 
development  steps  as  signified  in  submitted  theses/proposals: 

•  Tony  Bustamante,  Senior  Honors  Thesis  “Environmental  and  Geometrical  Effects  on  Excitons 
in  Carbon  Nanotubes”  defended  in  2007. 

•  Geoffrey  Ussery,  PhD  Thesis  Proposal  “Physical  properties  of  solvated  charge  carriers  on 
one-dinrensional  semiconductors”  submitted.  PhD  defence  is  expected  in  the  Summer  2009. 

•  Michael  Mayo,  PhD  Thesis  Proposal  “Model  ab  initio  study  of  excess  charge  carrier  solvation 
on  conjugated  carbon  chains”  submitted.  PhD  defence  is  expected  in  the  Fall  2009. 

3.  Peer-reviewed  publications  of  the  completed  work 

The  following  papers  have  already  been  published  acknowledging  support  from  UTD- 
SPRING  Research  and  Nanotechnology  Transfer  Program: 

•  Yu.N.  Gartstein,  T.  D.  Bustamante  and  S.  Ortega  Castillo,  “Polarons  and  excitons  on  a  cylin¬ 
der:  a  simplified  model  for  nanotubes  in  polar  environments” ,  Journal  of  Physics:  Condensed 
Matter  19,  156210  (2007). 

•  Yu.N.  Gartstein  and  V.M.  Agranovich,  “Excitons  in  long  molecular  chains  near  the  reflecting 
interface”,  Physical  Review  B  76,  115329  (2007). 

•  Yu.N.  Gartstein  and  G.L.  Ussery,  “One-dinrensional  semiconductor  in  a  polar  solvent:  Sol¬ 
vation  and  low-frequency  dynamics  of  an  excess  charge  carrier”,  Physics  Letters  A  372,  5909 
(2008). 

•  M.L.  Mayo  and  Yu.N.  Gartstein,  “Charge  carrier  solvation  and  large  polaron  formation  on 
a  polymer  chain  revealed  in  model  ab  initio  computations”,  Physical  Review  B  78,  073402 
(2008). 

•  G.L.  Ussery  and  Yu.N.  Gartstein,  “Optical  absorption  from  solvation-induced  polarons  on 
nanotubes”,  Journal  of  Chemical  Physics  130,  014701  (2009). 

•  M.L.  Mayo  and  Yu.N.  Gartstein,  “Model  ab  initio  study  of  excess  charge  carrier  solvation 
and  large  polaron  formation  on  conjugated  carbon  chains” ,  Journal  of  Chemical  Physics  130, 
134705  (2009). 

4.  Presentations  of  the  completed  work 

Scientific  results  of  the  project  have  been  presented  at  several  venues: 

•  Yuri  Gartstein  and  Tony  Bustamante,  “Nanotubes  in  Polar  Media:  Polarons  and  Excitons 
on  a  Cylinder”,  UTD-Technion  Workshop;  UTD,  January  2007 

•  Yuri  Gartstein  and  Tony  Bustamante,  “Nanotubes  in  polar  environments:  polarons  vs  exci¬ 
tons”  ,  Oral  presentation  at  the  7th  International  Conference  on  Optical  Probes  of  Conjugated 
Polymers  and  Functional  Self-Assemblies;  Turku,  Finland,  June  2007 

•  Geoffrey  Ussery  and  Yuri  Gartstein,  “Nanotubes  in  polar  environments:  Solvated  charge 
carriers  and  their  dynamics”,  APS  March  Meeting;  New  Orleans,  March  2008 

•  Michael  Mayo  and  Yuri  Gartstein,  “Charge  carrier  solvation  on  a  polymer  chain  revealed  in 
ab  initio  computations”,  APS  March  Meeting;  New  Orleans,  March  2008 

•  Michael  Mayo  and  Yuri  Gartstein,  “Model  ab  initio  studies  of  solvation  and  excess  charge 
localization  on  conjugated  carbon  chains”,  APS  March  Meeting;  Pittsburg,  March  2009 
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•  Geoffrey  Ussery  and  Yuri  Gartstein,  “Nanotubes  in  polar  solvents:  Solvation  and  physical 
properties  of  excess  charge  carriers”,  2009  Spring  Meeting  of  the  Texas  Sections  of  the  APS, 
AAPT,  and  SPS;  Stephenville,  Texas,  April  2009 

5.  The  work  in  progress  and  future  plans 

In  addition  to  the  completed  work  cited  above,  there  is  an  intense  ongoing  work  that 
will  continue  until  an  acceptable  level  of  fruition  is  achieved.  The  following  tasks  are  being 
currently  addressed: 

•  Electron  transfer  between  ID  semiconductors  (nanotubes)  and  small  molecular  species. 
Classic  descriptions  of  the  electron  transfer  as  applied  to  chemical  reactions  in  solutions 
and  to  electrochemistry  include  Marcus  theory  of  the  transfer  between  small  molecular 
species  and  Marcus- Gerischer  model  of  the  transfer  to  bulk  semiconductor  electrodes. 
In  the  former,  the  solvent  fluctuations  affect  the  discrete  electronic  levels  of  both  con¬ 
stituents  while  in  the  latter,  the  band  structure  of  the  semiconductor  is  considered  rigid 
thereby  represented  by  the  bare  density  of  electronic  states.  The  strong  solvation  effect 
we  have  studied  for  excess  charge  carriers  on  ID  semiconductors  requires  a  description 
“in  between”  those  two  as  the  solvent  fluctuations  “smear”  the  rigid  electronic  struc¬ 
ture  and  produce  localized  molecular  orbitals  with  intragap  energies.  We  are  working 
on  the  corresponding  theoretical  development. 

•  Ab  initio  studies  of  solvation.  In  order  to  better  evaluate  the  role  of  many-electron 
effects,  we  are  pursuing  a  systematic  comparative  study  of  polaronic  effects  at  different 
levels  of  the  theory.  Compared  will  be  various  hybrid  DFT  and  pure  Hartree-Fock 
computations.  Also  under  consideration  is  application  of  time-dependent  PCM  schemes 
dubbed  recently  “a  new  frontier  for  quantum-mechanical  continuum  models” .  We  also 
explore  types  of  charged  excitations  other  than  simple  polarons,  such  as  kink-solitons 
and  bipolarons. 

•  Modulation  of  the  exciton  energies  in  carbon  nanotubes  by  various  dielectric  environ¬ 
ments.  The  observable  exciton  energy  is  determined  by  both  the  gap  energy  for  in¬ 
dividual  electron  and  hole  quasi-particles  and  their  binding  energy.  We  are  currently 
evaluating  the  effects  on  quasi-particle  excitations  that  result  from  the  fast  polarization 
components  in  order  to  compare  them  to  corrections  from  the  exchange  interactions 
previously  discussed  in  the  literature.  Also  of  interest  here  is  the  modulation  of  the 
excitonic  effects  by  vibrational  modes  of  the  environment  whose  energies  are  close  to 
the  exciton  binding  energy. 

We  believe  the  scope  of  our  studies  is  of  relevance  and  potential  importance  for  various 
applications  of  carbon  nanotubes  and  related  ID  nanostructures.  We  intend  to  use  results 
derived  during  this  project  to  apply  for  further  funding  opportunities  to  be  able  to  continue 
this  line  of  research.  We  have  already  submitted  one  proposal  entitled  “Quantum  chemistry 
of  electron  solvation  on  conjugated  polymers”  (Welch  Foundation),  and  more  is  in  the  plan¬ 
ning  stage. 

II.  Synopsis  of  some  scientific  results 

A  comprehensive  discussion  of  our  results  is  available  on  the  following  pages.  In  this 
section  we  attempt  to  briefly  introduce  and  comment  on  some  of  the  results. 
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•  Comparing  polarons  and  excitons:  the  energetic  significance  of  the  effect  of  solvation. 
Within  a  simplified  model  of  charge  carriers  on  semiconducting  nanotubes  in  sluggish 
polar  environments  (such  as  common  solvents),  we  compared  the  binding  energies  E^xc 
of  the  Wannier-Mott  exciton  (a  closely  bound  electron-hole  pair)  and  of  the  polaron, 
E^°\  Our  variational  study  demonstrated  a  non-monotonic  dependence  of  the  ratio  of 
the  binding  energies  EfoX / Etffic  on  the  radius  of  the  nanotubes  and  that  this  ratio  may 
reach  values  as  large  as  about  0.35.  As  the  exciton  dissociation  under  these  conditions 
would  occur  into  a  polaron  pair,  the  thermal  activation  energy  Ef*c  —  2 Ef°l  for  the 
dissociation  is  thereby  decreased  by  a  factor  of  about  3  compared  to  the  bare  exciton 
binding,  evidently  a  very  substantial  effect.  It  is  expected  that  this  should  result  in 
enhanced  separation  of  charge  carriers  and  could  be  an  attractive  proposition  for  ap¬ 
plications  requiring  both  strong  excitonic  transitions  and  the  relative  ease  of  exciton 
dissociation.  We  believe  we  elucidated  the  basic  physics  behind  this  behavior  that  we 
characterized  as  the  crossover  effect  occurring  in  systems  crossing  over  “between'7  pure 
3D,  2D  or  ID.  Hence,  we  conjecture  the  relevance  for  other  semiconducting  nanos¬ 
tructures  (quantum  wells,  quantum  wires)  in  contact  with  polar  environments  and  the 
possibility  to  judiciously  exploit  other  geometries.  Another  generic  implication  is  the 
possibility  to  influence  some  excitations  more  than  the  other  by  geometrical  means, 
that  is,  by  selectively  matching  the  geometry  to  the  size  of  the  desired  excitation. 

•  Coherent  optical  properties  of  exciton-polaritons  in  ID  molecular  systems  in  the  vicinity 
of  a  metallic  substrate,  an  environment  characterized  by  a  strong  and  fast  polarization. 
In  this  study,  we  focus  on  a  more  tractable  model  of  (small-radius)  Frenkel  excitons. 
With  a  proper  modification  for  specifics,  however,  results  are  expected  applicable  to 
various  systems  including  nanotubes  and  quantum  wires.  The  metallic  substrate  acts 
as  to  modify  the  inherent  optical  properties,  hence  the  systems  of  this  composition 
may  be  of  relevance  for  optoelectronic  and  plasmonic  applications.  Our  results  indeed 
showed  a  strong  modification  of  patterns  of  the  radiative  decay  and  elucidated  the 
dependence  of  the  effects  on  the  polarization  and  energy  of  the  excitons  as  well  as 
on  the  distance  from  the  interface.  A  major  channel  of  the  decay  has  been  found  to 
be  the  excitation  of  surface  plasmons  of  the  substrate.  In  an  idealized  system,  the 
exciton-polaritons  form  two  branches  in  their  spectrum:  one  is  “super-radiant”  with 
respect  to  surface-plasmon  emission  and  the  other  is  non-emissive  representing  bound 
exciton-surface-plasmon  states.  Taking  into  account  the  realistic  dissipation  of  in  the 
substrate  modifies  the  idealized  picture  in  two  regards.  First,  as  the  wave  vector  of 
surface  plasmons  acquires  a  finite  uncertainty,  there  is  no  clean-cut  separation  into 
emissive  and  non-emissive  branches.  Second,  the  excitons  can  decay  in  a  non-coherent 
way  via  Joule  losses  in  the  metal.  The  results  suggest  that  one  could  control  the  relative 
importance  of  the  processes  by  the  variation  of  the  distance  to  the  substrate. 

•  Mobility  and  low-frequency  dynamics  of  polarons.  A  simplified  dynamical  model  has 
been  developed  that  allowed  to  study  the  formation  and  low-frequency  properties  of  po¬ 
larons  as  solvated  charge  carriers.  The  model  embarks  from  the  combined  microscopic 
dynamics  of  the  excess  electron  charge  density  and  the  surrounding  Debye  solvent  to 
result  in  a  macroscopic  Langevin  dynamics  of  the  polaron  center  of  mass.  This  way 
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we  have  been  able  to  derive  a  closed-form  solution  for  such  an  important  character¬ 
istic  as  polaron  mobility  and  find  its  functional  dependence  on  the  properties  of  the 
semiconductor  and  the  solvent.  Importantly,  numerical  estimates  indicate  that  polaron 
mobilities  can  be  orders  of  magnitude  lower  than  the  intrinsic  mobilities.  We  have  also 
identified  local  solvent  dielectric  relaxation  modes  forming  around  the  polaron  that 
may  play  a  role  in  the  low-frequency  absorption. 

•  Optical  absorption  from  polarons.  Another  testable  experimental  signature  of  polarons 
is  the  higher-frequency  (infrared)  absorption  from  a  local  polaron  level.  We  performed 
calculations  of  such  absorption  based  on  a  simplified  model  of  solvated  carriers  on  nan¬ 
otubes.  The  calculations  show  that  over  a  wide  range  of  system  parameters,  the  major 
contribution  (oscillator  strength  of  about  0.85-0.9)  to  the  absorption  corresponds  to 
the  promotion  of  a  charge  carrier  to  the  next  local  level  (rather  than  to  the  band 
states)  within  the  polarization  potential  well.  The  transition  energy  was  evaluated  as 
a  function  of  system  parameters  that  should  assist  in  prospective  experimental  studies. 
We  have  also  shown  that  thermal  fluctuations  of  the  environment  result  in  a  substan¬ 
tial  (order-of-magnitude  larger  than  the  thermal  energy)  broadening  of  this  transition. 
Gross  features  of  the  resulting  absorption  line  shapes  have  been  illustratively  calcu¬ 
lated. 

•  Ab  initio  studies  of  excess  charge  carrier  solvation.  Using  state-of-the-art  hybrid  DFT 
calculations  in  conjunction  with  the  Polarizable  Continuum  Model  (PCM)  as  imple¬ 
mented  in  Gaussian  03,  we  have  been  able  to  accomplish  a  first  -  to  the  best  of  our 
knowledge  -  ab  initio  demonstration  of  the  solvation-induced  self-consistent  localization 
of  the  excess  charge  leading  to  ID  large-radius  polaroninc  states.  As  these  computa¬ 
tions  are  quite  demanding,  this  demonstration  relied  on  conceptually  prototypical  but 
structurally  simpler  systems  of  conjugated  polyynic  carbon  chains.  Both  electron-  and 
hole-polarons  have  been  shown  to  be  emerging  entirely  due  to  solvation  but  even  larger 
degrees  of  charge  localization  can  occur  when  accompanied  by  carbon  atom  displace¬ 
ments.  Ab  initio  computations  are  expected  to  help  in  clarification  of  the  role  played 
by  many-electron  interactions,  and  in  fact,  our  results  indicate  the  reorganization  of 
valence  band  electrons  taking  place  upon  the  polaron  formation.  From  a  theoretical 
standpoint,  our  demonstration  can  also  be  thought  of  as  a  new  avenue  for  applications 
of  quantum-mechanical  continuum  solvation  models. 
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Abstract 

Excess  charge  carriers  on  semiconducting  nanotubes  immersed  in  sluggish 
polar  environments  can  undergo  self-localization  into  polaronic  states.  Using  a 
simplified  model  of  electrons  and  holes  of  equal  effective  masses  and  confined 
to  a  cylindrical  surface  in  the  three-dimensional  polar  medium,  we  evaluate  the 
binding  energy  E£0  of  adiabatic  Frohlich-Pekar  polarons  and  compare  it  to  the 
corresponding  exciton  binding  energy  The  ratio  Zs£ol/£“c  is  found  to  be  a 
non-monotonic  function  of  the  cylinder  radius  R  which,  in  an  idealized  model, 
can  reach  values  of  about  0.35,  substantially  larger  than  values  of  about  0.2  for 
two-dimensional  (2D)  or  three-dimensional  (3D)  systems.  We  argue  that  these 
findings  represent  a  more  general  crossover  effect  that  could  manifest  itself  in 
other  semiconductor  nanostructures  in  3D  polar  environments.  As  a  result  of  the 
strong  polaronic  effect,  the  activation  energy  of  exciton  dissociation  into  polaron 
pairs  is  significantly  reduced,  which  may  lead  to  enhanced  charge  separation. 


1.  Introduction 

Low-dimensional  semiconductor  structures  such  as  quantum  wells,  quantum  wires,  nanotubes 
and  conjugated  polymers  are  important  for  practical  applications  and  interesting  scientifically. 
It  is  known  that  the  confinement  of  the  motion  of  charge  carriers  in  some  directions  leads 
to  increased  effects  of  the  Coulomb  interaction  on  system  excitations.  In  this  paper  we  are 
concerned  with  two  types  of  such  effects:  excitonic  and  polaronic.  The  excitonic  effect 
refers  to  the  formation  of  Coulombically  bound  electron-hole  pairs,  Wannier-Mott  excitons, 
which  progressively  affect  optical  properties  of  semiconductors:  the  exciton  binding  energy 
£“c  increases  from  its  3D  value  to  2D  and,  further  on,  to  quasi-one-dimensional  (ID) 
magnitudes  ( £jjxc  diverges  in  pure  ID)  [1],  The  polaronic  effect  occurs  in  polar  media, 
where  the  Coulomb  field  of  an  individual  charge  carrier  causes  the  polarization  (deformation) 
of  its  surroundings,  resulting  in  the  carrier  self-localization  into  polaronic  states.  Such 
polarons  have  been  extensively  studied  especially  in  the  context  of  3D  ionic  crystals  and  polar 
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semiconductors  [2-5].  The  polaronic  effect  also  grows  with  the  confinement:  the  polaron 
binding  energy  Zi^’01  in  2D  is  larger  than  in  3D  and  would  diverge  in  pure  ID  [6,  7].  Multiple 
aspects  of  the  excitonic  and  polaronic  effects  have  been  explored  in  many  publications  for 
various  specific  low-dimensional  systems. 

Of  particular  interest  to  us  is  a  relationship  between  and  ,  each  of  the  binding 
energies  understood  as  being  measured  from  the  band  edges  in  the  absence  of  the  polaronic 
effect.  The  ratio  E^  /  E™c  of  the  binding  energies  has  a  clear  significance  for  the  relative 
energetics  of  closely  bound  and  well-separated  electron-hole  pairs  that  is  expected  to  affect 
practically  important  processes  of  charge  separation  and  recombination.  The  bare  value  E^xc 
signifies  the  ionization  energy,  whether  thermal  or  photo,  required  for  ‘unbinding’  of  the 
exciton  into  a  well-separated  band-edge  electron  and  hole.  In  the  presence  of  the  polaronic 
effect,  however,  the  thermal  ionization  (dissociation)  would  occur  into  a  distant  electron- 
polaron  and  hole-polaron  so  that  the  exciton  thermal  ionization  energy  is  reduced  from  E^xc 
to  E^xc  —  2E£o1.  We  are  interested  in  how  much  of  this  relative  reduction  might  be  possible 
to  achieve  due  to  the  formation  of  strong-coupling  (adiabatic)  polarons.  Our  discussion  here 
is  restricted  to  systems  with  equal  electron  and  hole  effective  masses,  me  =  mtl  =  m,  so 
that  the  electron-polaron  and  hole-polaron  have  the  same  binding  energies  while  the  closely 
bound  exciton  is  neutral  both  globally  and  locally  and  therefore  does  not  polarize  the  sluggish 
component  of  the  medium  in  the  adiabatic  approximation  [8,  9]. 

It  is  instructive  to  look  at  the  ratio  E^°l/E^xc  based  on  the  results  known  for  isotropic 
systems  of  ‘well-defined’  dimensionality,  that  is,  for  purely  3D,  2D  and  ID  systems.  One 
would  then  find  that,  while  each  of  the  binding  energies  increases  with  more  confinement,  their 
growth  occurs  nearly  ‘in  proportion’  so  that  the  ratio  does  not  change  significantly.  Indeed,  the 
classic  Pekar’s  result  [10]  for  3D  adiabatic  polarons  would  translate  into  the  maximum  ratio  of 
about  0.22.  The  exciton  binding  in  2D  increases  by  a  factor  of  four  from  its  3D  value  [1]  but 
the  polaron  binding  energy  in  2D  increases  by  almost  as  much  [6],  resulting  in  the  ratio  ~0.20. 
Moreover,  if  the  divergent  purely  ID  binding  energies  are  taken  (parametrically)  for  their  ratio, 
then  the  result  of  [7]  would  translate  into  a  maximum  Zs£ol/£“c  of  approximately  0.17. 

From  the  standpoint  of  this  data,  the  results  of  our  recent  model  quasi-ID  calculations  [11] 
for  polarons  and  excitons  on  nanotubes  immersed  in  a  3D  polar  medium,  yielding  E^X / £“c  in 
excess  of  0.3,  appear  quite  surprising.  In  the  particular  case  of  the  tubular  geometry,  charge 
carriers  are  confined  to  motion  on  a  cylindrical  surface.  Our  study  [11]  was  restricted  to 
relatively  small  cylinder  radii  R.  In  this  paper  we  will  use  a  direct  variational  approach  to 
calculate  adiabatic  polarons  for  arbitrary  R,  thereby  enabling  an  assessment  of  the  evolution  of 
the  ratio  E^a1/ £“c  between  the  purely  2D  ( R  — >  oo)  and  quasi-ID  regimes.  The  corresponding 
calculations  for  excitons  on  a  cylindrical  surface  have  been  performed  recently  [12,  13]  and  are 
very  much  in  line  with  our  exciton  data.  We  will  demonstrate  explicitly  (see  figure  2(b))  that 
the  ratio  of  the  polaron  and  exciton  binding  energies  exhibits  non-monotonic  behaviour  as  a 
function  of  the  cylinder  radius  and  can  achieve  values  as  large  as  about  0.35  at  intermediate 
R ,  where  the  cylinder  circumference  is  roughly  comparable  to  an  appropriate  Bohr  radius. 
We  believe  that  our  demonstration  of  El°l/E^xc  ratios  above  the  values  in  purely  3D  and  2D 
systems  can  be  rationalized  by  invoking  simple  physical  arguments.  These  arguments  also 
suggest  that  the  relative  increase  in  the  polaronic  effect  that  is  found  may  reflect  a  general 
behaviour  that  might  be  characterized  as  a  crossover  effect. 

Consider  a  gradual  increase  in  the  confinement,  e.g.  by  starting  to  decrease  the  radius  of 
a  very  large  cylinder  in  going  from  the  purely  2D  system  towards  quasi- ID  or  by  starting  to 
decrease  the  thickness  of  a  very  thick  quantum  well  in  going  from  the  purely  3D  system  towards 
2D.  Stated  simplistically,  the  spatial  size  (extent)  of  a  still  unconfined  polaron  wavefunction 
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is  larger  than  that  of  an  unconfined  exciton.  As  the  confinement  increases,  therefore,  the 
polaron  can  start  experiencing  substantial  growth  in  its  binding  energy  due  to  the  confinement 
before  a  ‘proportionally’  substantial  growth  of  the  exciton  binding  energy  sets  in.  With  yet 
further  increase  in  confinement,  both  polaron  and  exciton  binding  energies  will  reflect  fuller 
confinement  effects,  resulting  in  the  corresponding  trend  of  the  decreasing  ratio  ££ol/£™. 
Of  course,  quantitative  aspects  of  the  evolution  can  vary  for  different  systems  and  need  to  be 
evaluated  accordingly.  We  also  stress  that  the  above  consideration  tacitly  assumed  the  existence 
of  a  uniform  3D  polarizable  medium,  with  the  confinement  affecting  only  the  motion  of  charge 
carriers.  Strong  violations  of  this  assumption  can  significantly  affect  the  outcome  for  the  ratio 
of  binding  energies. 

While  serving  as  a  suggestive  illustration  of  possibly  general  behaviour,  it  is  polarons  and 
the  relationship  of  polarons  and  excitons  on  a  cylindrical  surface  that  are  the  subjects  of  our 
direct  interest  in  this  paper.  Organic  and  inorganic  tubular  (nano)structures  (see,  e.g.,  [14,  15]) 
attract  a  great  deal  of  attention  and  are  considered  to  be  candidate  systems  for  important 
applications  like  (photo)electrochemical  energy  conversion  and  the  (photo)catalytic  production 
and  storage  of  hydrogen,  as  well  as  in  optoelectronics.  On  the  one  hand,  their  extended 
size  along  the  tube  axis  can  facilitate  very  good  electron  transport  in  that  direction.  On  the 
other  hand,  tubes  can  expose  large  areas  of  both  exterior  and  interior  surfaces  to  facilitate 
surface-dependent  reactions.  Many  of  these  applications  involve  contact  with  polar  liquid 
environments  such  as  common  aqueous  and  non-aqueous  solvents  and  electrolytic  solutions 
which  can  provide  conditions  appropriate  for  the  strong  polaronic  effect  [11],  thereby  changing 
the  nature  of  charge  carriers.  A  widely  known  example  of  the  tubular  geometry  is  single-walled 
carbon  nanotubes  (SWCNTs)  and,  in  fact,  redox  chemistry  of  carbon  nanotubes  is  an  ‘emerging 
field  of  nanoscience’  [16].  We  note  that  the  importance  of  the  excitonic  effects  in  the  optics  of 
semiconducting  SWCNTs  is  well  established  now,  with  binding  energies  £“c  experimentally 
measured  in  some  SWCNTs  to  be  in  the  range  of  0.4-0. 6  eV  [17-19].  There  is  also  growing 
evidence  of  environmental  effects  on  the  electronic  properties  of  SWCNTs  [20-22]. 

It  can  be  anticipated  that  the  polaronic  effect  that  we  discuss  would  have  an  influence  on 
both  charge-transfer  reactions  and  charge  carrier  dynamics  on  the  tubes.  In  the  context  of  our 
discussion  of  the  relative  energetics,  a  substantial  reduction  in  the  activation  energy  due  to  the 
polaronic  effect  should  be  expected  for  electric-field-assisted  exciton  dissociation  and  charge 
separation  on  semiconducting  nanotubes. 

Our  illustrative  calculations  of  the  polaronic  effect  in  this  paper  do  not  take  into  account 
details  of  the  electronic  band  structure  and  polarizability  of  different  nanotubes  but,  instead,  are 
restricted  to  a  simplest  model  of  electrons  and  holes  confined  to  a  cylindrical  surface  in  the  3D 
dielectric  medium.  The  assumption  of  equal  electron  and  hole  effective  masses  is  a  reasonable 
approximation,  e.g.  for  SWCNTs  as  well  as  for  boron  nitride  tubes  [23]. 

2.  Exciton  and  polaron  energy  functionals 

As  noted  above,  our  basic  model,  following  references  [12]  and  [13],  assumes  that  electron  and 
hole  are  particles  of  the  same  effective  mass  m  whose  motion  is  restricted  to  the  surface  of  the 
cylinder  of  radius  R.  The  cylindrical  surface  itself  is  immersed  in  the  uniform  3D  dielectric 
continuum  characterized,  as  is  common  in  studies  of  the  polaron  [2-4]  and  solvation  [24-26] 
effects,  by  two  magnitudes  of  the  dielectric  permittivity:  the  high-frequency  (optical)  value 
of  €oo  and  the  low-frequency  (static)  value  of  es.  In  the  case  of  liquid  polar  media,  the  slow 
component  of  the  polarization  is  ordinarily  associated  with  the  orientational  polarization  of  the 
solvent  dipoles,  and  it  is  typical  [25,  11]  that  es  <?oo-  The  fast  component  of  the  polarization 
follows  charge  carriers  instantaneously.  The  slow  component  of  the  polarization,  on  the  other 
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hand,  is  considered  to  be  static  in  the  adiabatic  picture  to  determine  the  electronic  states;  the 
slow  component  then  responds  to  the  averaged  electronic  charge  distribution. 

With  me  =  m  h ,  there  is  no  net  charge  density  associated  with  the  ground  state  of  the 
neutral  exciton  that  would  cause  a  static  polarization  of  the  slow  component  (‘non-polarizing 
exciton’).  The  Coulomb  interaction  between  the  electron  and  the  hole  in  the  exciton  is  screened 
only  by  the  high-frequency  dielectric  response  €oo  [8,  9,  27].  The  total  energy  of  the  exciton  as 
a  function  of  the  normalized  wavefunction  of  the  electron-hole  relative  motion  (reduced 
mass  m/2)  is  then 


£exc{if]  =  Ke 


ue 


=-[ 


dr  |Vi/r(r)|' 


—  Sir 

Cx>  J 


IVKr)|- 
D(  r) 


(1) 


whose  global  minimum  determines  the  exciton  ground-state  wavefunction  and  its  binding 
energy.  For  our  geometry,  position  vector  r  =  (jc.v)  is  confined  to  the  cylindrical  surface, 
where  we  choose  x  to  be  along  the  cylinder  axis  and  —  jtR  <  y  <  tt  R  to  be  along 
the  circumferential  direction;  |Vi/f(r)|2  =  (difr/dx)2  +  {dijs/dy)2.  Coulomb  interaction 
is  determined  by  the  physical  distance  in  the  3D  space;  in  the  flat  geometry  it  would  be 
Z)(r)  =  |r|,  for  the  points  on  the  cylindrical  surface 

D(r)  =  (x2  +  4R2  sin2  .  (2) 


Optimization  of  the  functional  (1)  for  the  ground  state  of  the  exciton  on  a  cylinder  has  been 
performed  in  [12]  and  [13],  with  the  results  being  in  very  good  agreement  with  our  data  to  be 
used  in  the  comparison  with  the  polaron. 

The  polarons  that  we  discuss  here  are  of  the  large-radius  Frohlich-Pekar  type,  where  the 
Coulomb  field  of  an  individual  charge  carrier  supports  a  self-consistent  dielectric  polarization 
pattern  surrounding  the  carrier.  The  formation  of  large-radius  polarons  (self-localization, 
self-trapping)  occurs  due  to  the  interaction  of  charge  carriers  with  the  ‘slow’  component  of 
polarization.  As  the  fast  component  of  polarization  does  not  contribute  to  the  polaronic  effect, 
it  is  the  effective  dielectric  constant  e*: 


1/e*  =  1/eoo  -  l/es, 


(3) 


which  affects  the  coupling  strength  [2-4,  24-26].  In  the  adiabatic  approximation,  the 
normalized  wavefunctions  i//-(r)  of  a  self-localized  charge  carrier  correspond  to  the  minima 
of  the  following  polaron  energy  functional: 


=  Kv°l  -  Up°l 
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whose  global  minimum  we  will  be  seeking  for  the  ground  state  of  the  polaron.  The 
functional  (4)  has  the  form  that  is  well  known  for  the  large  strong-coupling  polarons  [5,  9] 
and  is  a  result  of  the  optimization  of  the  total  adiabatic  energy  functional  with  respect 
to  the  polarization  of  the  medium,  thereby  exhibiting  an  effective  self-interaction  of  the 
electron  [4,  9,  11].  Correspondingly,  the  [/po1  term  in  equation  (4)  is  known  to  be  ‘made  of’ 
two  parts:  —  [/po1  =  —  t/JJ01  +  Upo1,  where  C/Jj01  represents  the  magnitude  of  the  potential  energy 
of  the  electron  in  the  polarization  field  and  U^oi  represents  the  energy  required  to  create  this 
polarization  (‘deformation  energy’);  with  the  optimal  polarization,  U^oi  =  Up°l/2. 

Both  energy  functionals  (1)  and  (4)  assume  that  the  electron  and  hole  energies  are 
measured  from  the  band  edges. 
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It  is  convenient  to  factor  out  dependences  on  physically  relevant  combinations  of 
parameters  by  introducing  appropriate  units  of  energy  and  length.  We  will  choose  such  units 
based  on  combinations  for  the  exciton  Bohr  radius  and  binding  energy  (effective  Rydberg)  in 
3D: 


«o  =  2eh2 /me1,  Ryd  =  e2 /2eag. 


(5) 


For  the  exciton  problem,  equation  (1),  one  uses  e  =  €oo  in  equation  (5),  and  for  the 
polaron  problem,  equation  (4),  one  uses  e  =  e*.  We  use  superscript  indices  ‘exc’  and 
'pol'  to  distinguish  between  the  corresponding  units  (5).  With  all  the  coordinates  (x,  y  for 
the  cylinder)  measured  in  appropriate  a o,  one  arrives  at  dimensionless  energy  functionals, 

£-exc  =  £excyRydexc  and  £P°1  =  .fpol/RydP01,  where 


and 
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The  dimensionless  £>o(r)  in  equations  (6)  and  (7)  features  the  same  behaviour  as  equation  (2) 
but  with  R  replaced  by  the  corresponding  Rg  —  R/ao. 

The  global  minima  of  equations  (6)  and  (7),  —  £“c  and  —  £q01  respectively,  would 
determine  the  dimensionless  binding  energies.  As  the  units  (5)  already  establish  the  scaling 
rules,  in  what  follows  we  will  be  comparing  Egxc  and  £g01  at  the  same  values  of  Rg.  The  ratio 
Zsg0l/£'oxc  would  have  a  direct  physical  meaning  of  the  maximum  achievable  when  es  e_x 
and  e*  —  €oo  in  equation  (3).  As  we  mentioned  earlier,  this  can  be  a  typical  situation  for  many 
polar  solvents. 

Before  proceeding  with  the  analysis  for  a  cylindrical  surface,  we  recall  in  more  detail  the 
benchmarks  known  for  isotropic  <7-dirnensional  systems  (Aj(r)  =  |r|  =  r)  briefly  described  in 
the  introduction.  The  exact  isotropic  excitonic  ground  state  ^/(r)  corresponding  to  equation  (6) 
is  given  by  the  solution  of  the  Schrodinger  equation, 


-£ox>  = 
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yielding  well-known 
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With  our  choice  of  units,  £“c  =  1  in  3D. 

The  polaronic  ground-sate  wavefunctions  and  energies  corresponding  to  equation  (7)  are 
known  from  variational  calculations  for  ^-dimensional  systems.  It  is  customary  in  the  polaronic 
literature  to  express  energies  in  terms  of  the  coupling  constant  ac  =  (me4/2e*2/t3oj) 1,2  and 
phonon  frequency  w.  Note  that  the  combination  a 2  hat  appearing  in  the  results  for  strong¬ 
coupling  (adiabatic)  polarons  corresponds  to  2Rydpo1,  as  defined  in  equation  (5).  Thus  the 
well-known  Pekar’s  result  [10]  for  the  3D  polaron  translates  into  Fpo1  ~  0.218  and  into  the 
same  magnitude  of  the  ratio  Eq°'/Eqxc  in  3D. 

The  2D  polaron  has  been  studied  in  great  detail  in  [6],  with  the  best  result  of  Eg01  ~  0.809 
for  the  adiabatic  case  being  achieved  with  Pekar-type  trial  wavefunctions.  Since  Eqxc  =  4 
in  2D,  the  ratio  E^’1  / E^xc  ~  0.202,  which  is  only  slightly  smaller  than  in  3D.  The  2D  case 
represents  the  limit  R  00  for  a  cylindrical  surface  and  is  particularly  important  for  our 
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Figure  1.  The  spatial  distribution  of  unnormalized  density  (r)|2  for  the  exciton  (dashed  line)  and 
the  polaron  (solid  line)  in  2D.  The  polaron  wavefunction  is  as  per  equation  (9).  The  inset  shows  the 
corresponding  spatial  distribution  U (r)  of  the  potential  energy  in  U  =  J  U{r)  dr. 


Figure  2.  (a)  The  dimensionless  exciton  and  polaron  binding  energies  as  functions  of  the 

dimensionless  inverse  cylinder  radius  l/Ro-  The  inset  shows  the  behaviour  of  the  polaron  binding 
energy  in  more  detail  at  small  \/Ro-  (b)  The  ratio  of  the  binding  energies  Eq°1/Eqxc  of  the 
excitations  on  a  cylinder  is  shown  as  a  thick  solid  line.  A  thin  solid  line  displays  this  ratio  as 
is  obtained  in  the  quasi- ID  calculation  with  the  effective  tube  potential  (15).  A  thin  short-dash 
line  shows  the  quasi- ID  result  for  this  ratio  if  the  effective  interaction,  instead,  was  that  of  a 
quantum  wire,  equation  (16).  A  thick  long-dash  line  displays  the  dimensionless  activation  energy 
pexc  _  ?  zrPo1 


analysis.  We  have  looked  at  simpler  one-parametric  trial  wavefunctions  that  would  make  a 
good  representation  of  the  2D  adiabatic  polaron  and  found  that  the  wavefunction 


\j/{r)  oc 


1 

cosh  (ar) 


(9) 


with  a  —  1.674  yields  a  very  good  optimization  for  the  energy:  £g01  ~  0.804,  which  is 
quite  accurate  for  our  purposes.  Figure  1  compares  the  spatial  structure  of  the  2D  exciton  and 
polaron. 

As  both  exciton  and  polaron  binding  energies  diverge  in  pure  ID,  the  following  comparison 
might  be  of  a  dubious  nature  but  is  still  interesting.  Specifically,  reference  [7]  discussed  the 
calculation  of  the  ID  polaron  in  terms  of  the  renormalized  coupling  constant  ac'  =  a.J (d  —  1), 
where  d  — >  1.  The  best  variational  result  achieved  for  the  adiabatic  polaron  was  Zs£o1  ~ 
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0.333  (a^rhco).  As  the  diverging  //-dependence  in  this  expression  is  the  same  (d  —  1)  2  as 
in  the  exciton  case  (8),  the  ratio  of  the  binding  energies  in  ID  could  then  be  interpreted  as 


E^'/E™  ~  0.167. 


3.  Variational  analysis  for  a  cylinder 


When  on  a  cylindrical  surface,  both  exciton  and  polaron  ground  states  need  to  be  determined 
numerically.  Our  variational  analysis  of  the  energy  functionals  (6)  and  (7)  has  been  performed 
on  the  following  classes  of  the  trial  wavefunctions3.  For  the  exciton  problem  we  have  used 
three-parametric  (a,  and  y  )  wavefunctions 

i AO,  y)  oc  exP  [-(“2*2  +  P2y2i  +  r)1/2]  •  (i0) 

The  polaron  problem  is  much  more  demanding  on  computation  time,  so  we  chose  two- 
parametric  (a  and  ft)  wavefunctions 


i js(x,  y)  oc 


1 

cosh  (a2x2  +  fi2y\) 1^ 


(11) 


The  functional  dependences  in  equations  (10)  and  (11)  are  such  that  they  can  recover,  in  the 
limit  of  Rq  — >  oo,  wavefunctions  (8)  and  (9)  found  for  the  2D  systems — similarly  to  the  earlier 
exciton  calculations  [12,  13]. 

We  explored  two  choices  for  the  effective  coordinate  yi  in  equations  (10)  and  (11):  ‘arc- 
based’  (as  in  [13]) 


yi  =  y,  (12) 

— 7T  Rq  <  y  <  it  Rq,  and  ‘chord-based’  (as  in  [12]) 

yi  =  2Rq  sin  (y/2R0) .  (13) 

Both  choices  can  be  thought  of  as,  respectively,  n  1  and  n  =  1  limits  of  more  general 


yi  =  7tR0 


IT 


y 


TXRq 


n  -|  ^  1/m 


which  could  be  used  in  future  refinements  as  being  more  flexible  in  terms  of  the  shape  of  the 
wavefunction  being  periodic  in  the  circumferential  direction.  In  this  paper  we  resorted  to  just 
choosing  the  best  results  among  those  obtained  with  equations  (12)  and  (13). 

Wavefunctions  (10)  and  (11)  feature  two  parameters  a  and  /l  having  the  meaning  of 
inverse  lengths,  thereby  explicitly  allowing  for  anisotropy  of  the  wavefunction  extent  in  the 
axial  and  circumferential  directions  [13].  What  we  will  later  be  referring  to  as  quasi- ID  results 
corresponds  to  =  0  when  the  wavefunctions  are  uniform  around  the  cylinder  circumference. 

The  main  quantitative  results  of  this  paper  are  displayed  in  figure  2,  showing  the  optimized 
variational  outputs  for  the  exciton  and  polaron  binding  energies  as  well  as  their  ratio  Eg0l/EgXC. 
Our  exciton  data  is  very  close  to  the  results  of  [12]  and  [13],  where  the  reader  can  find  extensive 
discussions. 

Just  as  in  the  exciton  case,  the  polaron  binding  energy  exhibits  very  little  change  from  its 
2D  value  (~0.8)  due  to  the  curvature  of  up  to  R o  ~  1  (see  the  inset  in  figure  2(a))  where  it 
starts  rising,  relatively  earlier  and  more  rapidly  than  the  exciton  dependence.  This  immediately 
translates  into  a  substantial  increase  in  the  ratio  Eg0l/Egxc  with  decreasing  Rq,  as  shown  in 
figure  2(b).  The  maximum  of  the  ratio  ~0.35  is  achieved  in  the  region  of  1  /  R{,  ~  10,  after 


3  All  numerical  integrations  and  optimizations  have  been  performed  by  using  IMSL  numerical  libraries,  as  provided 
with  the  PV-WAVE  Advantage  package,  http://www.vni.com. 
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Figure  3.  The  behaviour  of  the  functional  (7)  at  Rq  =  0.65  with  the  trial  wavefunctions  (11)  as  a 
function  of  the  variational  parameters  a  and  ft  for  two  choices  of  the  effective  coordinate  yi :  (a)  as 
in  equation  (12)  and  (b)  as  in  equation  (13). 


which  the  ratio  starts  slowly  decreasing  with  1/  Rq-  While,  of  course,  larger  ratios  E^o1/EqXC 
lead  to  larger  relative  reductions  in  the  effective  activation  energy  Eqxc  —  2 Eq0>,  it  is  quite 
interesting  that  the  absolute  value  of  this  activation  energy,  also  shown  in  figure  2(b),  exhibits 
a  non-monotonic  dependence  on  \/  Rq.  A  region  around  the  minimum  of  this  curve  indicates 
specific  tube  sizes  where  the  activation  energy  would  be  at  its  lowest. 

Our  variational  results  have  shown  that  the  the  dependence  of  Eq° 1  on  1  /  Rq  collapses  onto 
the  corresponding  quasi- ID  curve  practically  right  away  after  the  onset  of  a  substantial  rise  in 
binding.  This  is  different  from  the  exciton  case,  where  deviations  from  the  quasi- ID  behaviour 
persist  all  the  way  into  the  region  of  \/  Rq  >  20.  In  other  words,  the  polaron  spreads  uniformly 
around  the  cylinder  at  much  smaller  curvatures  than  the  exciton  does.  We  note  that  the  quasi-ID 
variational  results  for  the  polaron  binding  using  trial  wavefunctions  (11)  and  (10)  differ  very 
little  and  are  in  good  agreement  with  our  analysis  in  [11],  where  no  assumptions  have  been 
made  about  the  wavefunction  shape  and  the  nonlinear  optimizing  equation  has  been  solved 
numerically.  The  thin  solid  line  in  figure  2(b)  shows  the  ratio  £g0l/£“c  using  quasi- ID  results 
for  the  binding  energies,  and  its  deviation  from  the  variational  result  for  a  cylinder  is  entirely 
due  to  underestimation  of  the  exciton  binding. 

As  with  all  variational  calculations,  we,  of  course,  cannot  exclude  that  some  details  in 
the  results  shown  in  figure  2  may  undergo  slight  modifications  upon  further  improvements  of 
variational  wavefunctions.  Importantly,  possible  improvements  would  be  inconsequential  for 
our  main  observations  of  a  non-monotonic  dependence  of  the  ratio  Eq°  /  £qxc  on  I  /Rq  and  of 
the  magnitude  of  the  ratio  reaching  values  well  above  the  2D  value  of  ~0.2.  We  found  that  a 
rise  in  the  ratio  above  0.3  is  obtainable  even  if  the  polaron  wavefunctions  are  not  specifically 
optimized  for  a  range  of  given  Rq  but  the  2D  optimal  values  of  a  =  ~  1 .67  are  used.  For  very 

small  tubes  with  \/Rq  <  20,  the  results  of  exact  numerical  calculations  in  [11]  complement 
the  picture. 

The  transitional  region  of  Rq  ~  1  is  likely  to  be  especially  sensitive  to  the  choice  of 
trial  wavefunctions.  So,  reference  [12]  reported  an  improvement  of  a  few  percent  for  the 
exciton  binding  energy  in  the  region  of  1  <  \/Rq  <  2.5  with  certain  trial  functions.  Similar 
improvements  could  perhaps  be  found  for  the  polaron  binding  energies.  Figure  3  illustrates 
the  behaviour  of  the  polaron  functional  (7)  in  the  transitional  region,  at  Rq  =  0.65,  as  a 
function  of  the  variational  parameters  a  and  /l.  A  curious  feature  of  the  ‘landscape’  in 
figure  3(a)  is  a  clear  coexistence  of  two  minima,  one  corresponding  to  a  polaronic  state  that 


J.  Phys.:  Condens.  Matter  19  (2007)  156210 


Yu  N  Gartstein  et  al 


is  uniformly  distributed  (delocalized)  around  the  cylinder  circumference,  and  the  other  where 
the  circumferential  distribution  is  non-uniform.  While  this  appears  as  an  interesting  possibility, 
it  could  also  be  an  artefact  of  a  restricted  nature  of  a  specific  class  of  the  trial  wavefunctions; 
compare  visually,  for  example,  to  the  landscape  of  figure  3(b).  A  much  more  careful  study 
would  be  needed  to  explore  the  possibility  of  coexisting  polaronic  states  requiring  an  analysis  of 
actual  adiabatic  potential  surfaces.  An  example  of  an  analogous  analysis  can  be  found  in  [28], 
where  we  proved  a  coexistence  of  different  polarons  in  certain  quasi- ID  systems. 


4.  Discussion 

Both  in  this  paper  and  in  [11]  we  have  shown  that  the  strong  polaronic  effect  occurring  in 
sluggish  polar  environments  substantially  affects  the  relative  energetics  of  closely  bound  and 
well- separated  electron-hole  pairs  on  a  cylindrical  surface  of  nanotubes.  The  binding  energy 
£po1  of  an  individual  polaron  can,  in  an  idealized  model,  reach  as  much  as  about  0.35  of  the 
binding  energy  £bxc  of  the  exciton.  This  would  translate  into  a  reduction  in  the  activation  energy 
Zs“c  —  2 £bo1  for  exciton  dissociation  by  a  factor  of  about  three  from  the  value  £bxc  that  it  would 
have  in  a  non-polar  environment  with  the  same  value  of  the  high-frequency  dielectric  constant 
6oo-  Note  that,  in  our  earlier  study  [11],  we  have  not  found  any  additional  energy  barriers 
between  the  exciton  and  distant  polaron-pair  states.  One  should  expect  that  enhanced  separation 
of  charges  and  a  corresponding  luminescence  quenching  would  then  result,  for  example,  in 
experiments  probing  the  effects  of  electric  field  on  the  luminescence.  It  is  needless  to  say  that 
additional  factors  omitted  in  our  simplified  analysis  here  can  make  the  reduction  magnitude 
smaller  (see,  e.g.,  a  comparison  in  [1 1]  of  cases  with  different  ‘electrostatic  conditions’). 

Polar  liquid  environments  such  as  many  common  solvents  may  be  good  candidates  for 
providing  conditions  necessary  for  the  strong  polaronic  effect,  as  they  can  exhibit  both  high 
values  of  the  static  dielectric  constant  es  and  a  relatively  slow  response  of  the  orientational 
polarization  (longitudinal  relaxation  times  can  be  on  the  order  of  1  ps  and  longer)  [25,  11]. 
Quite  fittingly,  there  is  an  ongoing  intense  research  effort  on  various  applications  of  nanotubes 
in  contact  with  such  environments,  and  we  hope  that  direct  experimental  verifications  of 
our  qualitative  conclusions  will  be  possible.  We  note  that  a  comprehensive  mapping  of 
luminescence  versus  absorption  spectra  of  individual  SWCNTs  was  in  fact  achieved  in  aqueous 
suspensions  [29,  30].  Interestingly,  numerical  estimates  in  [13]  indicate  that  the  dimensionless 
radius  Rq  for  a  range  of  SWCNTs  may  be  close  to  0.1,  which  corresponds  to  the  region  of 
maximum  E^°x /E^c  ratios  in  figure  2(b). 

We  have  demonstrated  that  the  ratio  of  the  binding  energies  /i,1’"1/ £“c  has  a  non-monotonic 
dependence  on  the  cylinder  curvature  1  /R.  As  argued  in  the  introduction,  our  particular 
observation  for  the  cylindrical  geometry  can  be  a  manifestation  of  a  more  general  crossover 
effect  that  would  be  common  for  other  structures  in  3D  polar  media  when  the  increasing 
confinement  of  the  electron  motion  causes  a  ‘transition’  between  d -dimensional  systems,  such 
as  between  3D  and  2D  (quantum  wells)  or  between  3D  and  ID  (quantum  wires).  Basically,  the 
origin  of  this  effect  can  be  related  to  the  fact  that  the  spatial  extent  of  an  unconfined  polaron  is 
larger  than  the  size  of  an  unconfined  exciton  (see  figure  1  for  the  2D  case),  thereby  making  the 
polaron  ‘respond’  to  the  initially  increasing  confinement  in  a  more  pronounced  way  than  the 
exciton.  Only  after  the  exciton  experiences  a  fuller  effect  of  the  confinement,  the  ratio  of  the 
binding  energies  starts  decreasing.  Elaborating  more  on  this  idea,  the  inset  in  figure  1  shows 
the  spatial  distribution  of  the  potential  energy  terms  U:  £  —  K  —  U,  for  unconfined  excitations. 
It  is  evident  that  the  ‘longer-range’  contributions  to  the  polaron  potential  energy  are  relatively 
more  important  than  for  the  exciton.  This  is  why  the  effect  of  increasing  confinement  on  the 
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Figure  4.  Distance  dependence  of  the  effective  1 D  Coulomb  potentials  for  charges  on  a  cylinder, 
equation  (15),  solid  line,  and  in  a  quantum  wire,  equation  (16),  long-dash  line,  in  comparison  with 
the  original  Coulomb  interaction  (14),  short-dash  line. 


polaron  is  initially  stronger.  In  the  particular  case  of  the  cylindrical  geometry,  the  curvature 
changes  remote  physical  distances  (chords  instead  of  arcs,  equation  (2))  more  than  it  does  close 
distances. 

A  meaningful  parallel  can  be  drawn  with  the  behaviour  of  the  ratio  U /  K  of  the  potential 
and  kinetic  energy  terms  in  confined  systems.  The  virial  theorem  for  the  Coulombically  bound 
states  in  unconfined  t/-dimensional  systems  states  that  U/K  =  2,  independent  of  d,  which, 
of  course,  also  follows  directly  from  scaling  of  both  functionals  (1)  and  (4)  provided  that 
D(r)  =  | r |  =  r.  In  particular,  polarons  in  such  systems  are  known  [31,  9,  4]  to  satisfy  the 
following  ratios  for  various  energy  terms:  El°l:Kpol\Up°l:U^°l\U^o]  =  1:1:2:4:2.  Some  of 
these  relationships  are  violated  in  confined  systems.  As  studied  in  [32],  the  virial  theorem  ratio 
Ucxc  j  Kcxc  for  excitons  in  quantum  wells  and  quantum  wires  is  larger  than  2  and,  in  fact,  a  non¬ 
monotonic  dependence  of  f/exc/A'exc  has  been  demonstrated  for  quantum  wells  transitioning 
between  3D  and  2D  limits.  We  have  found  a  non-monotonic  behaviour  of  the  ratio  U /K  for 
both  polarons  and  excitons  on  a  cylinder  as  a  function  of  the  curvature  1  / R.  In  agreement  with 
our  qualitative  arguments,  at  smaller  1  /  R  this  ratio  grows  much  more  quickly  for  the  polaron 
than  for  the  exciton.  At  large  curvatures,  however,  the  trend  is  reversed  and  the  exciton  has 
larger  ratios  U / K  than  the  polaron. 

It  is  useful  to  continue  a  qualitative  reasoning  by  discussing  the  quasi- ID  limit  of  our 
results,  that  is,  the  case  of  stronger  but  still  finite  degrees  of  confinement.  One  can  then  use  the 
notion  of  the  effective  Coulomb  potentials  [1,  32],  here  as  a  function  of  the  ID  (axial)  distance 
x.  (The  effective  Coulomb  potentials  for  2D  can  be  similarly  introduced  [32]).  Figure  4 
compares  three  potentials  in  units  such  that,  at  very  large  distances,  the  potentials  behave  as 


Veff(x)  =  Ro/x. 


(14) 


In  this  limit  the  electron  wavefunction  on  a  cylinder  is  delocalized  around  its  circumference 
and  the  effective  interaction  becomes  that  of  rings  of  charge  given  by 


Veff(x)  = 


2 

tt[(x/Rq)2  +  4]1/2 


K 


4 

(x/R0)2  +  4 


(15) 


where  K (m)  —  (1  —  m  sin2  0)  V2  d 0  is  a  complete  elliptic  integral  of  the  first  kind.  If, 

instead  of  a  tube,  we  dealt  with  a  quantum  wire,  the  electron  wavefunction  would  be  delocalized 
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throughout  the  cross-section  of  the  wire,  with  the  effective  interaction  approximated  as  [1] 

Vrff(x)  =  r  /  g  !  -l  n  V  (16) 

( x/R0 )  +  0.3 

Both  effective  tube  (15)  and  wire  (16)  ID  potentials  feature  modifications  of  the  shorter- 
range  interaction  from  the  original  Coulomb  (14)  due  to  the  transverse  spread  of  wavefunctions, 
thereby  eliminating  a  pure  ID  divergence  of  the  ground  states  for  excitons  and  polarons.  As  we 
discussed  above,  the  role  of  the  longer-distance  interactions  is  more  important  for  the  polaron 
than  for  the  exciton.  Since  the  relative  modification  of  the  original  Coulomb  to  the  effective 
potentials  is  increasing  towards  shorter  distances,  it  is  then  clear  that  the  ratio  E^ol/E^xc  in 
systems  with  modified  interactions  should  be  larger  than  values  of  about  0.2  in  systems  with 
pure  Coulomb  interaction.  Moreover,  following  the  same  logic,  one  should  expect  that  the 
larger  modification  is  from  the  original  Coulomb  distance  dependence,  the  larger  will  be  the 
E^°l/E^xc  ratios.  Figure  4  shows  that  deviations  from  the  Coulomb  dependence  for  the  effective 
wire  potential  are  smaller  than  for  the  tube  potential  (over  a  relevant  spatial  range).  We  have 
performed  a  quasi- ID  variational  optimization  of  the  polaron  and  exciton  binding  energies  with 
the  wire  potential  (16).  The  resulting  ratios,  Eq°1/EqXC,  are  shown  in  figure  2(b)  with  a  short- 
dash  thin  line  and  are,  indeed,  smaller  than  the  ratios  calculated  with  the  tube  potential  (15) — 
solid  thin  line  in  that  figure. 

These  qualitative  arguments,  while  conhrmed  by  specific  model  calculations,  do  not  appear 
to  be  restricted  to  this  specific  situation.  We  therefore  expect  that  findings  of  larger  magnitudes 
of  the  ratio  E^ol/E^xc  and  of  a  non-monotonic  dependence  of  this  ratio  on  the  degree  of 
confinement  represent  a  crossover  effect  that  can  be  common  to  semiconductor  nanostructures 
in  3D  polar  environments.  Further  calculations  with  different  geometric  structures  are  needed 
to  validate  this  conjecture  and  evaluate  its  quantitative  aspects,  including  in  non-uniform  polar 
environments. 

We  would  like  to  reiterate  that  numerical  data  that  we  arrived  at  in  this  paper  are  based 
on  a  simplified  model;  more  accurate  calculations  would  have  to  take  into  account  specific 
aspects  of  the  nanotube  band  structure  and  the  tube’s  own  polarizability.  Optical  studies  of 
SWCNTs  particularly  indicate  the  dependence  of  the  exciton  binding  not  only  on  R  but  also 
on  the  chirality  of  the  tubes;  see  the  discussion  and  multiple  citations  in  [33].  The  latter 
reference  also  shows  separately  the  effect  on  excitons  of  tube’s  polarizability;  our  earlier  quasi- 
ID  calculations  are  consistent  with  their  findings,  and  we  refer  the  reader  to  [1 1]  for  illustrations 
of  this  effect  on  polarons. 

Another  interesting  subject  for  future  research  is  an  assessment  of  the  activation  energy 
for  exciton  dissociation  in  confined  semiconductors  with  unequal  electron  and  hole  masses, 
me  ^  mt].  In  this  case,  electron-polarons  and  hole-polarons  have  different  binding  energies 
and,  in  addition,  the  exciton  itself  can  cause  an  adiabatic  polarization  of  the  environment  [8, 27]. 
The  corresponding  ‘polaronic’  corrections  to  the  exciton  binding  in  quantum-well  wires  have 
been  studied,  for  example,  in  [34],  From  a  general  standpoint,  one  would  also  like  to  extend 
the  analysis  of  the  effects  of  polar  media  on  the  dissociation  of  confined  excitons  to  the 
intermediate-coupling  [2-4]  regime. 
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We  discuss  coherent  exciton-polariton  states  in  long  molecular  chains  that  are  formed  due  to  the  interaction 
of  molecular  excitations  with  both  vacuum  photons  and  surface  excitations  of  the  neighboring  reflecting 
substrate.  The  resonance  coupling  with  surface  plasmons  (or  surface  polaritons)  of  the  substrate  can  substan¬ 
tially  contribute  to  the  retarded  intermolecular  interactions  leading  to  an  efficient  channel  of  the  decay  of 
one-dimensional  excitons  with  small  momenta  via  emission  of  surface  excitations.  The  interface  also  modifies 
the  radiative  decay  of  excitons  into  vacuum  photons.  In  an  idealized  system,  excitons  with  higher  momenta 
would  not  emit  photons  or  surface  waves.  For  a  dissipative  substrate,  additional  exciton  quenching  takes  place 
owing  to  Joule  losses  as  the  electric  field  of  the  exciton  polarization  penetrates  the  substrate.  We  discuss  how 
these  effects  depend  on  the  polarization  of  molecular  excitations,  their  frequency,  and  the  distance  of  the  chain 
from  the  substrate. 
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I.  INTRODUCTION 

The  interaction  of  the  electromagnetic  field  with  a  mo¬ 
lecular  excitation  in  an  aggregate  of  identical  molecules 
leads,  on  the  one  hand,  to  the  delocalization  of  the  excitation 
over  the  aggregate  and,  on  the  other  hand,  to  a  modification 
of  its  radiative  decay.1  Both  notions  of  excitons  and  polari¬ 
tons  are  used  in  the  literature  on  such  delocalized  excitations. 
In  crystalline  structures,  the  excitation  can  be  spatially  co¬ 
herent  and  then  it  is  characterized  by  its  wave  vector  as  a 
proper  quantum  number.  In  this  paper  we  discuss  how 
electric-dipole-active  coherent  excitations  in  linear  crystals 
are  affected  by  the  presence  of  a  neighboring  metallic  or 
dielectric  half-space. 

A  variety  of  one-dimensional  (ID)  electronic  systems 
available  nowadays,  such  as  conjugated  polymers,  J  and  H 
aggregates,  semiconducting  quantum  wires,  and  carbon 
nanotubes,  exhibit  interesting  optical  properties  and  are  con¬ 
sidered  for  potential  applications  in  optoelectronics;  their 
spectroscopy  is  an  active  research  area.  Successes  at  the  syn¬ 
thesis  and  fabrication  of  these  systems  have  resulted  in  a 
continuously  improving  quality  and  an  increase  of  their 
“conjugation  length”  L  which  may  exceed  the  appropriate 
electromagnetic  wave  length  X.  Perhaps  one  of  the  most 
noteworthy  achievements  in  this  regard  is  the  recent 
observation2  of  a  macroscopic  coherence  of  a  single  exciton 
in  poly  diacetylene  chains  of  L—  10  /a  m  that  allowed  one  to 
discuss  the  issue  of  an  “ideal  ID  quantum  wire.”3  It  is  also 
relevant  to  note  a  physically  very  different  but  conceptually 
related  class  of  excitations  in  chains  of  “dipole-coupled” 
nanoparticles  (see,  e.g.,  the  recent  work  in  Ref.  4  and  mul¬ 
tiple  references  therein)  studied  for  photonic  and  plasmonic 
applications. 

Long  before  the  modern  experimental  advances,  it  was 
shown1,5  that  the  coherent  interaction  of  low-dimensional 
(ID  and  2D)  excitons  with  an  electromagnetic  field  is  dras¬ 
tically  different  than  in  3D  systems.  The  radiative  decay  of 
low-dimensional  excitons  is  strongly  enhanced  in  the  region 


of  their  wave  vectors  |q|  sS  k,  where  k=a>/c  (c  is  the  speed  of 
light  in  vacuum)  is  determined  self-consistently  by  the  exci¬ 
tation  energy  E(q)  =  ftu>,  while  the  excitations  with  |q|>k 
would  not  radiate,  as  is  required  by  energy  and  momentum 
conservation  for  an  exciton-photon  system.  More  specifi¬ 
cally,  for  ID  excitons  in  the  molecular  chain  in  vacuum,  the 
radiative  width  T  =  fil  t  (t  being  the  decay  time)  depends  on 
the  wave  vector  q  as 

Tv(k,g)  =  — —  [2(k2  -  q2)cos2  0+(k2  +  q2)  sin2  #]©(k- |g|), 
a 

(1) 

where  p  is  the  magnitude  of  the  molecular  dipole  transition 
moment  p  that  makes  an  angle  0  with  the  chain  axis,  a  the 
intermolecular  spacing  along  the  chain,  and  ©  the  step  func¬ 
tion.  As  compared  with  the  radiative  width 

T°(k)  =  \p2k2n>  (2) 

of  a  single  molecule,  Eq.  (1)  exhibits  an  enhancement  factor 
of  ~\/a  =  2ir/ka~  102-103  in  the  optical  region  of  the  spec¬ 
trum  (“superradiant”  states).  This  enhancement  has  been  dis¬ 
cussed  in  the  context  of  various  systems  (see,  e.g.,  Refs.  6-9 
and  references  therein).  Equation  (1)  also  shows  how  the 
polarization  of  the  transition  dipole  affects  the  q  dependence 
of  the  decay  rate. 

Importantly  for  applications,  it  is  possible  to  manipulate 
the  optical  properties  of  molecular  excitations  and  to  form 
new  hybrid  excitations  by  putting  molecules  or  molecular 
aggregates  in  the  vicinity  of  interfaces  and  in  dielectric 
microcavities.10  Well  known,  e.g.,  is  an  oscillating  depen¬ 
dence  of  the  radiative  width  on  the  distance  of  a  single  mol¬ 
ecule  from  the  planar  interface  resulting  from  the  interfer¬ 
ence  of  the  radiative  fields  of  a  molecular  transition  and  its 
image. 10,1 1  The  dipole-dipole  interaction  also  gets  modified 
in  the  vicinity  of  a  surface  or  in  the  cavity.12-14  In  the  case  of 
a  linear  molecular  crystal,  the  environment  can  lead  to  quali¬ 
tatively  interesting  coherent  effects  as  it  is  the  interaction  of 
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FIG.  1.  Decay  width  r(k,g)  of  ID  exciton  polaritons  as  a  function  of  the  reduced  wave  vector  qlk  for  a  fixed  value  of  &=2-7r/400a  and 
substrate’s  dielectric  constant  e=-10.  Two  rows  correspond  to  different  dipole-to-image-dipole  distances  d:  upper  panels  (a)-(c)  are  for 
d=600a,  lower  panels  (d)-(f)  for  d=10a.  All  distances  are  measured  in  units  of  intermolecular  spacing  in  the  chain,  a.  Each  of  the  trio  of 
panels  in  a  row  corresponds  to  different  exciton  polarization,  indicated  by  the  boldface  letters.  The  chain  is  parallel  to  the  x  axis  and  situated 
above  the  substrate  whose  surface  is  the  xy  plane.  Decay  width  is  shown  with  respect  to  Tv  =  2irp1k2 / a  which  is  the  radiative  width  of  the 
x-polarized  exciton  in  vacuum  at  q  =  0,  Eq.  (2).  The  overall  vacuum  benchmark  results  (1)  are  shown  with  the  dash-dotted  lines.  The  total 
decay  width  is  displayed  by  thick  solid  lines;  the  dashed  solid  lines  (when  distinguishable)  show  the  part  of  the  width  due  to  the  decay  into 
vacuum  photons  only.  The  vertical  dash  lines  indicate  the  position  of  the  surface  plasmon  wave  vector,  Kp/k ,  Eq.  (4). 


many  molecular  transition  dipoles  (and  their  images)  that 
would  determine  the  properties  of  the  excitation.  Philpott,6 
for  instance,  pointed  out  that  by  placing  a  linear  chain  near  a 
transparent  substrate,  one  could  probe  some  of  the  otherwise 
nonradiant  polariton  states  via  emission  of  bulk  substrate 
photons.  It  was  also  studied  how  the  radiative  decay  proper¬ 
ties  of  quantum  wire  excitons  get  modified  when  the  wire  is 
embedded  in  a  microcavity — that  is,  via  emission  of  cavity 
photons  (Refs.  8  and  15  and  references  quoted  there).  As 
one-dimensional  arrays  are  considered  for  directed  energy 
transfer  applications,  their  interaction  with  the  environment 
may  also  be  used  to  achieve  certain  purposes  such  as,  e.g.,  to 
counteract  losses  by  embedding  the  array  in  the  gain 
medium.16 

In  this  paper  we  discuss  ID  coherent  dipole  excitations 
that  are  formed  in  the  neighborhood  of  the  planar  reflecting 
substrate  in  the  range  of  frequencies  &>  where  the  dielectric 
constant  of  the  substrate  medium  e(w)<0  and  the  substrate 
does  not  support  bulk  photon  modes.  This  is  the  situation 
that  is  most  easily  implementable  in  the  vicinity  of  a  metallic 
surface  and  which,  in  fact,  has  recently  received  a  consider¬ 
able  attention  in  the  context  of  both  organic  excitons17  and 
dipole  excitations  of  nanoparticles  (Ref.  18  and  references 
therein).  The  substrate  in  general  affects  both  the  dispersion 
of  the  excitons  and  their  lifetime.  We  will  show  that  the 
presence  of  the  substrate  may  result  in  new  interference  pat¬ 


terns  and  leads  to  a  plethora  of  behaviors  depending  on  the 
polarization  and  frequency  of  the  excitation  as  well  as  on  the 
distance  from  the  interface.  For  the  decay  width,  this  is  illus¬ 
trated  in  Fig.  1  that  exemplifies  substantial  differences  with 
the  vacuum  result  (1)  and  is  discussed  in  more  detail  later. 
This  figure  demonstrates  not  only  a  modification  of  the  ex¬ 
citon  decay  into  vacuum  photons,  but  also  the  decay  into 
substrate  surface  plasmon  (SP)  modes  [which  occur  for 
e(cj)  <-l],  the  significance  of  the  latter  channel  strongly  in¬ 
creasing  upon  approach  to  the  interface. 

The  importance  of  the  coupling  of  an  individual  dipole 
excitation  to  substrate  SP  modes  and  of  associated  resonance 
decay  and  scattering  phenomena  have  already  been  stressed 
for  both  molecules 19-21  and  nanoparticles.18  This  coupling 
strongly  increases  as  the  exciton  transition  frequency  ap¬ 
proaches  the  “resonance”  region  of  the  substrate  in  which 
e{u>)  is  close  to  -1.  The  increase  is,  of  course,  what  should 
be  expected  from  the  theory  of  electrostatic  image  forces22 
which  features  the  combination  factor 

e0=(i-6)/(i  +  6)  o) 

for  the  magnitude  of  image  charges.  One  should  be  aware, 
however,  that  this  is  also  the  region  where  both 
retardation19-20  and  dissipation11  effects  are  particularly  im¬ 
portant.  The  corresponding  enhancement  of  the  decay  of  ID 
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FIG.  2.  As  in  Fig.  1  but  for  d- 50a  and  e=-1.2.  Note  that  despite  the  distance  here  being  5  times  larger  than  in  panels  (d)-(f)  of  Fig.  1; 
the  decay  into  substrate  surface  plasmons  is  much  stronger. 


excitons  into  SPs  is  seen  in  the  illustration  of  Fig.  2  (e  = 
-1.2)  where,  in  comparison  with  Fig.  1  (e=-10),  it  clearly  is 
a  dominant  decay  channel;  consequently,  the  fluorescence 
efficiency  is  greatly  reduced.20  One  also  appreciates  the  fact 
that  the  resonant  enhancement,  as  shown,  occurs  over  the 
already  “superradiant”  vacuum  decay  rate. 

Both  Figs.  1  and  2  have  been  calculated  with  negligible 
exciton  scattering  and  substrate  losses  (see  Sec.  IV  for  a 
discussion  of  dissipation  effects);  hence,  they  are  reflective 
of  the  full  conservation  laws  for  our  exciton-photon-SP  sys¬ 
tem.  So  excitons  with  wave  vectors  |q|  >  k  cannot  decay  into 
vacuum  photons,  while  emission  of  SPs  can  occur  only  for 
|g|</cp(£)  where 

Kv  =  k[e/(e+ 1)]1/2  (4) 

is  the  well-known  (e.g..  Ref.  10)  wave  vector  of  the  SP  at  an 
appropriate  frequency.  In  such  an  idealized  system,  exciton 
polaritons  with  \q\  >  Kp(k)  would  be  nonemissive.  An  ex¬ 
ample  of  the  corresponding  qualitative  picture  of  the  disper¬ 
sion  spectrum  of  exciton  polaritons  in  the  chain  near  the 
substrate  is  shown  schematically  in  Fig.  3.  As  the  radiative 
decay  rate  vanishes  at  |g|  =k-0  (see  Figs.  1  and  2),  no  branch 
splitting  takes  place  upon  the  polariton  spectrum  crossing  the 
vacuum  photon  line.  The  branching  instead  occurs  in  the 
region  of  would-be  crossing  the  SP  dispersion  curve  [this 
splitting,  however,  is  absent  for  y-polarized  excitons  whose 
decay  rate  into  SPs  vanishes  at  |g|  =  Kp(A:)-0].  The  nonemis¬ 
sive  branch  2  in  the  figure  can  be  thought  of  as  of  excitations 
representing  a  coherent  mix  of  the  exciton,  photons  and  SPs 
of  the  same  momentum  projection  along  the  chain,  the  rela¬ 
tive  weight  contributions  depending  on  this  momentum.  We 
cannot  exclude  that  surface  plasmon  guiding  by  chains  of 
nanoparticles  found  in  recent  numerical  simulations23  is  re¬ 
lated  to  the  formation  of  the  discussed  bound  exciton-SP 
states. 

As  the  SPs  are  surface  states  characterized  by  2D  wave 
vectors,  the  inverse  square-root  singular  behavior  in  the  de¬ 
cay  rates  of  ID  excitons  upon  >/<p  seen  in  Figs.  1  and  2 
has  a  “dimensionality”  origin  similar  to  the  one  taking  place 
for  2D  (quantum  well)  excitons  decaying  into  3D  vacuum 
photons 1,5,24,25  and  ID  (quantum  wire)  excitons  decaying 
into  2D  cavity  photons.8 


In  what  follows  we  will  elaborate  on  the  interaction  of  ID 
excitons  with  the  neighboring  reflecting  substrate  using  the 
simplest  model  of  a  chain  of  molecules  with  a  single  molecu¬ 
lar  dipole  transition  (Frenkel-like  excitons).  We  note  that 
some  resulting  features  would  be  generically  valid  for 
Wannier-Mott  excitons  as  well.  We  however  do  not  pursue 
here  an  explicit  analysis  of  the  Wannier-Mott  excitons  that 
would  require  specific  modifications  for  their  bare  dispersion 
as  well  as  the  influence  of  the  substrate  on  the  exciton  bind¬ 
ing  and  oscillator  strength.  While  we  chose,  for  certainty,  to 
use  an  example  of  the  metallic  substrate  and  the  correspond¬ 
ing  surface  plasmon  excitations,  it  should  be  clear  that  the 
same  effects  would  occur  for  a  dielectric  substrate  whenever 
it  can  feature  the  region  of  negative  e(w)  and  surface  polar¬ 
iton  excitations. 


FIG.  3.  Schematically  (not  to  scale),  an  example  of  the  possible 
idealized  dispersion  of  x-polarized  ID  exciton  polaritons  in  the 
form  of  their  frequency  a >=E(q)/h  as  a  function  of  the  wave  vector 
q.  This  spectrum  is  a  result  of  the  interaction  of  the  bare  excitons 
with  vacuum  photons,  whose  spectrum  is  reflected  in  the  limiting 
long-dashed  line,  and  with  substrate  surface  plasmons,  whose  spec¬ 
trum  is  reflected  in  the  limiting  short-dashed  line  (<nsp  is  the 
asymptotic  SP  frequency).  The  exciton-polariton  spectrum  (solid 
lines)  is  split  into  two  branches.  The  states  of  branch  2  are  nonemis¬ 
sive  while  the  states  of  branch  1  can  decay  via  emission  of  photons 
and/or  SPs.  The  photon  emission  can  take  place  only  for  states  of 
branch  1  with  |g|  <k — that  is,  where  branch  1  lies  above  the  shown 
vacuum  photon  line;  exactly  at  their  crossing,  the  radiative  decay 
rate  vanishes  and  stays  zero  afterwards. 
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II.  FRENKEL  EXCITON  POLARITONS  IN  A  MOLECULAR 
CHAIN:  VACUUM  RESULTS 

Renormalization  of  a  single-molecule  transition  of  energy 
Et  and  the  formation  of  an  electric-dipole  exciton  band  in  a 
molecular  aggregate  can  be  derived  in  various  frameworks 
(see,  e.g..  Refs.  1  and  26)  with  identical  results.  Here  we  will 
use  a  simple  and  physically  transparent  description  having  a 
clear  underlying  semiclassical  analogy.  In  the  Heitler- 
London  approximation  (energy  Ef  is  much  larger  than  all 
other  energies  involved),  the  exciton  Hamiltonian  for  inter¬ 
acting  identical  molecules  can  be  written  as 

H~  (Ef+  To)  2  B],Bn  +  2  Vn_m{B\Bm  +  H.c.) 

n  n>m 

=  2  E(q)BqBq, 
q 

where  B'n  creates  an  excitation  on  the  nth  molecule  whereas 
B  [=  N~1,2^nelqnclBln  creates  an  excitation  of  the  wave  vector  q 
in  a  chain  of  N  molecules  with  the  intermolecular  spacing  a. 
Correspondingly,  the  exciton  energy 

E{q)  =  Ef  +  Uq  +  2  2  V n  cos(qna) ,  (5) 

n>  0 

where  V0  represents  a  possible  renormalization  for  a  single 
molecule,31  while  Vn  the  effective  intermolecular  interaction 
as  mediated  by  the  electromagnetic  field.  The  interaction  is 
handily  expressed  via  the  semiclassical  expression 

V„  =  -p-E(nax),  (6) 

with  p  being  the  molecular  transition  dipole  moment  and 
E(nax)  the  electric  field  produced  by  the  dipole  p=pp  at  the 
distance  na  along  the  chain  axis  chosen  to  be  along  x  (we  use 
hatted  letters  to  denote  unit  vectors). 

If  the  electric  field  E  is  the  total  retarded  field,  however,  it 
is  in  fact  also  a  function  of  the  oscillating  dipole  frequency 
co=ck  and  has  both  real  and  imaginary  parts.  Equation  (5) 
then  has  to  be  rewritten  as  a  more  general  equation 

E{q)=E' -iYI2  =  Ef+t{k,q),  (7) 

involving  the  self-energy  correction  X(k,q),  the  real-space 
transform  of  which 


~Z(k,x)  =  -  p  ■  E(£,xx)  (8) 

serves  to  replace  Eq.  (6).  Equation  (7)  self-consistently  (via 
k=E/hc )  determines  both  the  dispersion  E'(q)  =  Ej 
+  Re{2(A:,^)}  and  the  decay  width  T(g)  =  -2  Im{2(A:,<7)}  of 
the  exciton-polariton  states  as  a  function  of  their  wave  vector 
q.  This  simple  approach,  alternatively  formulated  in  terms  of 
Green’s  functions,  is  both  physically  appealing  and  powerful 
as  it  involves  only  classically  calculable  electric  fields;  vari¬ 
ous  aspects  of  it  have  been  used  for  different  geometries 
(e.g.,  Refs.  10,  11,  and  14  and  references  therein).  In  this 
paper  we  will  not  pursue  solving  specific  self-consistent 
problems  that  may  depend  on  a  multitude  of  parameters  and, 
instead,  be  discussing  the  self-energy  for  a  given  real  value 
of  parameter  k  (that  is,  the  oscillating  frequency)  for  differ¬ 
ent  values  of  the  excitation  wave  vector  q.  For  our  numerical 


illustrations  in  this  paper  we  chose  a  representative  value  of 
k  corresponding  to  the  wavelength  K  =  2tt/ k  =  400a,  a  reason¬ 
able  magnitude  for  the  optical  region  of  the  spectrum  given 
typical  spacing  a  ~  1 0  A .  Understandably,  typical  values  for 
nanoparticle  systems  would  be  different.4  With  retardation 
effects  neglected  (purely  electrostatic  fields),  the  value  of  k 
would  have  to  be  set  equal  to  zero. 

Let  us  briefly  review  the  application  to  a  molecular  chain 
in  vacuum  (see  also  Refs.  6  and  4).  Consider  the  standard22 
retarded  electric  field  at  the  point  r=rr  from  the  oscillating 
point  dipole  in  vacuum, 

Ev(£,r)  =  ^7  j^2[P  -  r(r  •  p)]  +  -  y  )[3 r(r  •  p)  -  p]| , 

(9) 


and  the  corresponding 

£v(&,  r)  =  -  p  •  Ev(£,  r) . 

This  expression  turns  out  to  be  directly  applicable  even  for 
the  decay  of  a  single  molecule  (see  also  Ref.  10):  indeed, 
calculating  -2  Im{Xv(A:,r^0)}  immediately  leads  to  the 
well-known  decay  width  (2).  To  derive  the  decay  rate  for  a 
ID  exciton  state  with  wave  vector  q ,  one  augments  this  de¬ 
cay  by  the  sum  of  contributions  from  other  molecules: 

Tv(A:,g)  =  r°(0  -  42  cos(qna)lm{Zv (k,nax)} , 

n>  o 


resulting,  after  evaluation  of  the  sum,  in  already  quoted  Eq. 
(1).  Exemplifying  a  general  feature  of  self-energy  correc¬ 
tions,  a  direct  inspection  easily  verifies  that,  for  a  fixed  k, 
Eqs.  (1)  and  (2)  satisfy,  as  expected, 

'^JT\k,q)=NT\k).  (10) 

9 


With  the  real  part  of  the  field  (9),  one  immediately  obtains 
the  effective  resonant  interaction  matrix  element: 


Re{2v(£,.r)}  =  (1  -  3  cos2  6) 
P“ 


cos(fct)  k  sin(fcr) 
5  +  ^ 


■  ( 1  -  cos2  6) 


2  A  k2  cos(fct) 


(ID 


exactly  the  same  result  that  would  be  derived  in  the  picture 
of  the  virtual  photon  exchange.26  Calculating  the  correspond¬ 
ing  sums  with  n> 0  (and  disregarding  the  irrelevant  single¬ 
molecule  renormalization  ’1)  for  many  molecules  in  the  long- 
wavelength,  |ga|<?  1,  expansion,  one  arrives  at 


4  Re{2v(M)} 

p 


(1-3  cos2  6) 


4£(3) 


+  k2  -  3  q2 


-  ( k 2  -  q2)b 


+  (1  -  cos2  6) k2b. 


(12) 


where  £(3)  —  1.202  and  b  =  \i\(\k2-q2\a2).  [It  is  useful  to  note 
that  approximation  (12)  actually  works  very  well  over  a  siz¬ 
able  portion  of  the  exciton  Brillouin  zone.]  The  known  loga¬ 
rithmic  divergence  in  Eq.  (12)  upon  \q\  -^k  signifies  the 
splitting  of  the  exciton-polariton  spectrum  into  two 
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branches5-7  as  caused  by  the  radiative  zone  component  of  the 
field  at  cos  1  in  Eq.  (11).  Of  course,  corresponding  to  this 
divergence  there  is  a  nonvanishing  decay  rate  at  | |  =  A:  —  0  in 
Eq.  (1).  The  vanishing  of  the  latter  takes  place  only  at 
cos  0 =  1 ,  and  in  this  case  the  exciton  dispersion  exhibits  a 
single-branch  behavior  with  a  cusp  at  \q\  =k. 

The  electrostatic  part  of  Eq.  (12)  features  a  nonparabolic 
behavior  <*q2  In (qa)2  at  |<;/|  >  k  due  to  the  long-range  nature 
of  the  dipole-dipole  interaction,  making  the  exciton  disper¬ 
sion  “steeper,”  the  behavior  that  recently  attracted  attention 
in  the  context  of  exchange-interaction  effects  for  excitons  in 
carbon  nanotubes.27  The  overall  width  of  the  bare  exciton 
zone  as  seen  in  Eq.  (12)  is  scaled  with  the  energetic  param¬ 
eter 


— S l(k,x)  =  f  KdKe  yd(  -^J+(kx)  + - J_(a:x)) 

P  Jo  \  2  2 y  / 

(15) 


for  y-polarized  dipoles,  and 

i  r  2 Q 

—Xl(k,x)  =  I  Kdi<e~yd - Jq(  kx)  (16) 

p~  Jo  y 

for  z-polarized  dipoles.  In  Eqs.  ( 14)— (16), 

J±(x)  =  Jo(x)  ±  h(x) 

are  composed  of  Bessel  functions  of  the  first  order  while 
parameters 


J  =  p2la\  (13) 

establishing  the  unit  for  the  nearest-neighbor  electrostatic 
dipole-dipole  interaction.  To  appreciate  the  scale  of  energies 
involved:  with  p=  1  D  and  a  =  1 0  A ,  for  instance,  J 
—  0.014  eV.  With  reasonable  variations  of  values  of  p  and  a, 
J  could  reach  magnitudes  —0.1  eV. 


III.  MOLECULAR  CHAIN  NEAR  THE  INTERFACE 

In  the  vicinity  of  the  interface  with  a  metallic  or  dielectric 
body,  the  total  electric  field  can  be  conveniently  represented 
as  a  sum  of  the  primary,  vacuum,  field,  discussed  in  Sec.  II 
and  the  secondary  field  due  to  the  induced  response  of  that 
body:  E  =  EV+ES.  Correspondingly,  the  self-energy  of  the  ex¬ 
citon  polaritons  is  also  represented  as  X  =  XV+XS.  In  what 
follows  we  discuss  the  induced  contribution  Ss(Ar ,  ^7)  coming 
from  a  half-space  characterized  by  the  dielectric  constant  e 
=  e(oj)  taken  at  the  frequency  o>=ck  in  the  geometry  of  the 
molecular  chain  (along  x),  being  parallel  to  the  separating 
interface  (xy  plane)  at  the  distance  Zt)=dl2  from  it  id  is  the 
distance  between  a  dipole  and  its  image). 

The  problem  of  an  electric  dipole  near  a  metallic  or  di¬ 
electric  half-space  is  a  classical  problem  first  treated  by 
Sommerfeld10'28  and  whose  solution  is  available  in  different 
forms.  Here  we  find  it  convenient  to  adopt  the  expressions 
for  the  electric  field  as  derived  in  Ref.  29.  In  the  context  of 
our  application  for  fields  along  the  chain,  it  matters  how  the 
dipoles  are  oriented  with  respect  to  both  the  chain  and  the 
interface.  For  a  chain  of  electric  dipoles  of  an  arbitrary  po¬ 
larization  p,  one  easily  finds  that 

2s(A :,x)  =p1$\{k,x)  +p2Xsy(k,x)  +  p2X\{k,x), 

where  axes-related  components  can  be  rewritten  from  results 
in  Ref.  29  as  follows: 

— zXsJk,x)  =  [  KdK  e~yd(  -^-J_(kx)  +  - — J+{kx ) 
p~  J0  \  2  2y 

(14) 


and 


7e~  ey  p_7e~y 

ye+ty  7e+y 


(17) 


y=  (k2  -  k2)1'2,  ye=  (k2  -  ek2)112  (18) 

[for  negative  u< 0,  « 1/2  =  -/(-«) 1/2  should  be  used  in  Eq. 
(18)].  One  can  straightforwardly  verify  that  the  no¬ 
retardation  limit  (k=0)  of  the  above  expressions  leads  to  the 
usual  electrostatic  fields  of  image  dipoles. 

Representations  (14)— (16)  are  quite  meaningful  physi¬ 
cally.  One  immediately  observes  that  the  pole  in  the  param¬ 
eter  Q  in  Eqs.  (17)  occurs  at  k  equal  to  Kp  in  Eq.  (4),  the 
wave  vector  of  the  surface  plasmon  (surface  polariton)  when¬ 
ever  real  e<-l.  Together  with  /?-po  lari  zed  photons,  SPs 
make  the  0-containing  contributions  in  Eqs.  (14)— (16).  The 
parameter  P-containing  terms,  on  the  other  hand,  correspond 
to  contributions  to  the  electric  fields  from  .v-po  lari  zed  pho¬ 
tons.  Expressions  (14)— (16)  taken  at  the  source  point  x=0 
would  describe  the  effect  of  the  half-space  on  the  electronic 
transition  in  a  single  molecule  as  studied  in  Refs.  19  and  20. 

We  now  turn  to  the  ^-dependent  self-energy  of  an  exciton 
in  a  long  chain  of  molecules.  Restricting  ourselves  to  chain- 
to-interface  distances  larger  than  the  intermolecular  spacing, 
d>  a,  one  can  safely  use  a  continuum  description  of  the  half¬ 
space  response: 


T(k,q) 


— Xs(£,x)cos(gx) . 
a 


(19) 


As  with  the  vacuum  case,  all  results  are  clearly  even  func¬ 
tions  of  q:  to  simplify  the  expressions,  we  will  therefore 
continue  assuming  q>0.  The  transformation  (19)  for  indi¬ 
vidual  dipole  contributions  ( 14)— ( 1 6)  is  facilitated  by  the 
integrals30 


dx  cos (qx)J0(Kx) 


dx  cos (qx)J2(Kx) 


1 

{K2-q2)m 


@(/r-g). 


1  -  2 q2/K2 

(*2- q2)m 


<d{K-q), 


(20) 


(21) 


for  x-polarized  dipoles, 


so  that  Eq.  (16),  e.g.,  is  transformed  into 
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FIG.  4.  Real  (upper  panels)  and  imaginary  (lower  panels)  of  the  self-energy  X(k,q)  of  exciton  polaritons  as  functions  of  the  reduced  wave 
vector  qlk  for  a  fixed  value  of  £=2ir/400fl,  e=— 1.1,  and  d=l0a.  Self-energy  is  shown  with  respect  to  J  from  Eq.  (13).  The  vertical 
short-dashed  lines  show  the  position  of  the  surface  plasmon  wave  vector,  Kp/k.  In  the  upper  panels,  the  long-dashed  lines  show  the 
electrostatic  (no-retardation)  results,  dash-dotted  lines  the  electrostatic  results  for  the  exciton  dispersion  in  vacuum. 


a  r  K3Oe~yd 

~iVl{k,q)  =  2  /  dK  (22) 

P  jq  y{K--q-yu 

and  similarly  for  Eqs.  (14)  and  (15).  The  step  functions  in 
Eqs.  (20)  and  (21),  as  is  also  reflected  in  Eq.  (22),  have  a 
clear  physical  significance  of  the  energy-and-momentum 
conservation  limitation. 

It  is  also  useful  and  meaningful  to  note  the  no-retardation 
limit  (k= 0)  of  the  above  expressions,  when  Q  in  Eq.  (17) 
becomes  equal  to  the  electrostatic  combination  (3),  and  the 
exciton  dispersion  would  be  affected  by  the  image  dipoles  as 

a  r  e~Kd 

— =  2  q2  - T-^clK  =  2q1KQ(qd),  (23) 

yo P  j  q  \K  - 1  ) 

-^Xsy(q)  =  2f  (k2  -  q2)ll2e~Kdd/<  =  2jKl(qd) ,  (24) 
QoP  Jq  d 

and 


Xl(q)=VM  +  Vy(q),  (25) 

where  Kq(x)  and  K ,  (x)  are  the  modified  Bessel  functions. 

The  effects  of  these  real  image  corrections  are  displayed 
in  Fig.  4  showing  an  example  of  the  total  //-dependent  exci¬ 
ton  self-energy  %(k,q)  including  both  vacuum  and  secondary 
field  contributions.  The  figure  is  a  result  of  an  illustrative 
calculation  for  an  idealized  (no  dissipation)  system  in  the 
resonance  region  (e=-l.l  so  that  Kp  —  3.3k)  at  a  relatively 


close  {d=  10a)  distance  from  the  interface — to  demonstrate  a 
possible  magnitude  of  the  effects.  The  figure  shows  both  the 
retardation  effects  as  well  as  the  fact  that  upon  the  increase 
of  the  wave  vector  q,  the  exciton  dispersion  approaches  the 
electrostatic  behavior.  It  is  clear  that,  in  principle,  the  elec¬ 
trostatic  image  forces  may  appreciably  modify  the  exciton 
dispersion. 

In  an  idealized  system,  the  secondary  field  contributions 
to  the  imaginary  part  of  the  self-energy  (exciton  decay)  can 
come  only  from  two  sources.  First,  it  is  the  regions  of  vari¬ 
able  k  in  the  integrals  where  the  parameter  y,  Eqs.  (18),  is 
imaginary.  As  Eq.  (22)  shows,  that  happens  only  for 
q  <  k — this  is  the  source  of  the  modification  by  the  substrate 
of  the  exciton  decay  into  vacuum  photons  that  we  explicitly 
illustrated  in  Fig.  1.  The  second,  and  dominating  in  Fig.  4, 
source  is  the  surface  plasmon  pole  k=kp  in  the  parameter  Q 
in  the  integrals.  As  Eq.  (22)  shows  again,  this  pole  contrib¬ 
utes  only  for  q  <  Kp.  signifying  the  decay  of  the  exciton  into 
a  SP.  The  pole  contributions  to  -Im{X(£,//)}  are  immediately 
calculable  and  can  be  represented  by  a  combination  of  a 
common  factor 


47 T 


.11/2 


- — E  exp[-  d(K2  -  k2)ll2~\ 


( 1  -  e)  ak ' 

and  polarization-  and  //-dependent  cofactors 

-2  /„  /  „2  7\1I2 


q2/xp(K2-q2y 


for  x  polarization. 


(26) 


(27) 
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FIG.  5.  Decay  width  T(£,g)  of  the  exciton  polariton  as  a  function  of  the  reduced  wave  vector  q/k  for  a  fixed  value  of  k  and  e=-0.8.  Two 
rows  correspond  to  different  dipole-to-image-dipole  distances:  upper  panels  (a)-(c)  are  for  t/=600n,  lower  panels  (d)-(f)  for  d=  10 a.  There 
is  no  surface  plasmon  channel  here,  and  the  total  decay  width  is  due  to  the  vacuum  photons  only. 


(K;-q2)m/Kp  (28) 

for  y,  and 

M  y(^-r)1/2  (29) 

for  z. 

Accompanying  decay’s  inverse  square-root  singularity  in 
Eqs.  (27)  and  (29),  also  seen  in  Figs.  4(d)  and  4(f),  are  the 
diverging  discontinuities  in  the  real  parts  [panels  (a)  and  (c) 
of  Fig.  4]  of  the  type  familiar  from  the  studies  of  the  decay 
of  2D  excitons  into  3D  vacuum  photons.1'’'24-2'’  Due  to  such 
a  discontinuity,  the  resulting  self-consistent  dispersion  of, 
e.g.,  x-polarized  excitations  would  split  into  two  branches  as 
shown  in  Fig.  3.  On  the  contrary,  the  decay  rate  of 
y-polarized  excitons  vanishes  at  Kp  in  Eq.  (28).  This  is  a 
consequence  of  the  polarization  of  SPs  whose  electric  field 
can  have  only  longitudinal  and  perpendicular  to  the  interface 
components  (like  /^-polarized  photons).  As  q  —>  Kp,  the  sur¬ 
face  plasmons  would  be  emitted  along  the  chain  direction; 
hence,  their  field  would  have  no  y  components  to  interact 
with  y  excitons.  [There  is  a  similarity  here  with  the  decay  of 
x-polarized  excitons  in  vacuum,  whose  decay  rate  vanishes 
at  q^k,  Eq.  (1).]  Correspondingly,  the  real  part  of  the  self¬ 
energy  in  Fig.  4(b)  does  not  exhibit  a  discontinuity  at  q=Kp 
(the  discontinuity  is  in  the  derivative)  with  the  resulting  ex¬ 
citon  spectrum  consisting  of  one  branch  only.  Quite  clear  is 
also  <*q2  behavior  for  x  excitons  in  Eq.  (27);  it  has  the  same 
origin  as  in  the  electrostatic  effect  (23) — at  q—> 0,  the  exciton 


polarization  of  the  chain  becomes  uniform  and  there  would 
be  no  polarization  charges  (a  vanishing  spatial  derivative)  to 
induce  images  in  the  substrate. 

The  SP  excitations  in  the  substrate  exist  in  the  region  of 
frequencies  where  e(co)<-l.  Qualitatively  different,  there¬ 
fore,  for  a  reflecting  substrate  is  another  region  of  frequen¬ 
cies  in  which  -1  <  e(co)  <  0.  In  this  case  exciton  polaritons  in 
the  molecular  chain  can  decay  only  into  vacuum  photons. 
The  (idealized)  substrate  then  serves  as  to  modify  the  disper¬ 
sion  of  excitons  and  their  radiative  decay.  An  example  of  the 
radiative  decay  modification  is  shown  in  Fig.  5  calculated  for 
£=-0.8  and  to  be  compared  with  Fig.  1.  Both  figures  feature 
the  same  distances  from  the  interface  but  different  magni¬ 
tudes  and  signs  of  the  image  charges,  Eq.  (3).  These  two 
facts  explain  the  similarities  and  differences  in  the  radiative 
decay  patterns  for  the  two  examples.  Among  the  important 
common  features,  one  should  notice  “oscillating”  q  depen¬ 
dences  of  the  decay  rate  at  larger  distances  and  the  vanishing 
of  the  radiative  decay  at  q^k.  As  the  chain  is  moved  farther 
away  from  the  interface,  even  more  undulations  would  be 
observed  in  the  q  dependence  of  the  decay  rate  as  a  result  of 
the  interference  with  the  image  dipoles. 


IV.  EFFECTS  OF  DISSIPATION  IN  THE  SUBSTRATE 

Real  metallic  (dielectric)  substrates  are  characterized  by 
finite  dissipation  (losses)  that  is  taken  into  account  phenom¬ 
enologically  via  the  imaginary  part  of  the  dielectric  function: 
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FIG.  6.  Decay  width  F(k,q)  of  exciton  polaritons  as  a  function  of  the  reduced  wave  vector  q/k  for  a  fixed  value  of  k=2irl 400a  in  units 
of  vacuum  Tv  =  2Trp2k2  /  a  and  for  different  (indicated  in  the  panels)  exciton  polarizations.  Here  substrate’s  dielectric  constant  is  complex. 
e=  — 1. 2+0.05/.  Each  of  the  panels  contains  four  curves  corresponding  to  different  distances  between  the  molecular  chain  and  the  substrate 
(top  to  bottom):  parameter  rf=10a,  20a,  30 a,  and  50a. 


e(w)  =  e'(a> )  +  ie"((o). 

In  the  context  of  the  effects  of  e"  on  SPs,  for  a  given  real 
frequency  w,  wave  vectors  of  the  SPs  consequently  acquire 
imaginary  parts  as  well:10 

Kp  =  «p  +  *  V 

where  k"  describes  the  damping  of  the  SP  modes.  Another 
way  to  look  at  /c"  is  as  of  the  uncertainty  (broadening)  of  the 
SP’s  wave  vector.  Hence  the  momentum  conservation  law  in 
the  exciton  decay  into  a  SP  would  not  be  obeyed  exactly  as 
in  idealized  systems  discussed  so  far.  One  thus  should  expect 
a  broadening  and  extension  of  the  range  of  finite  exciton 
decay  rates  into  the  region  of  the  exciton  wave  vectors  q 
>/Cp.  Moreover,  even  with  relatively  small  losses,  e"<g|e'|, 
the  relative  damping  of  the  SP  can  become  substantial  in  the 
resonance  region  of  e '  close  to  -1.  Indeed,  in  this  case,10 


kpIkp  “ 


c"/g' 

2(6' +  1)’ 


(30) 


pecially  illuminating  in  this  regard  is  a  second  maximum  on 
the  topmost  curve  in  Fig.  6(a).  Indeed,  similarly  to  the 
image-charge  effects  already  discussed  in  Sec.  Ill,  the  elec¬ 
trostatic  component  of  the  electric  field  of  x-polarized  exci- 
tons  has  a  q  and  d  dependence  reflected  on  the  rhs  of  Eq. 
(23),  the  maximum  of  which  is  achieved  at  q~lld.  As  the 
distance  from  the  interface  increases,  the  role  of  such  short- 
range  energy  transfer  from  the  higher-^  excitons  to  the  sub¬ 
strate  decreases  and  one  can  “tune”  it  off  while  still  having 
an  appreciable  decay  rate  of  the  lower-#  states  into  SPs.  In 
addition  to  the  largest-distance  curves  in  Fig.  6,  this  point  is 
illustrated  in  Fig.  7,  showing  clear  SP  emission  effects  both 
close  to  and  away  from  the  resonance  region.  The  distance 
dependence  of  the  SP  emission  intensity  is  governed  by  the 
factor  (26),  and  one  could  likely  “optimize”  the  relationship 
between  the  coherent  and  incoherent  energy  transfer  based 
on  the  system  regime  parameters. 


V.  DISCUSSION 


and  the  denominator  on  the  right-hand  side  (rhs)  of  Eq.  (30) 
could  “compensate”  for  the  smallness  of  e"  and  completely 
destroy  the  notion  of  a  coherent  SP. 

Figure  6  shows  results  of  the  calculation  of  the  exciton 
decay  for  the  substrate  with  the  dielectric  constant  e = —  1 .2 
+  0.05/  and  variable  distances  to  the  interface.  As  compared 
to  the  idealized  system,  one  immediately  notices  very  broad, 
especially  at  smaller  distances,  q  >  regions  of  the  exciton 
decay.  For  this  particular  value  of  the  complex  dielectric 
function,  however,  the  “uncertainty”  ratio  (30)  is  only  about 
0.1.  Thus  the  extremely  broadened  q  region  of  the  exciton 
decay  is  not  due  to  the  broadening  of  the  SPs  themselves;  the 
latter  can  explain  only  the  formation  of  the  finite-magnitude 
SP  peaks  clearly  seen  in  Fig.  6.  Of  course,  with  complex 
dielectric  constants,  the  imaginary  part  of  the  exciton  self¬ 
energy  discussed  in  Sec.  Ill  is  contributed  to,  in  principle,  by 
the  whole  integration  range  in  integrals  like  Eq.  (22),  rather 
than  just  by  the  around-the-pole  region.  What  is  reflected  in 
the  broad  “wings”  in  Fig.  6  is  the  result  of  the  ordinary 
“incoherent”  Joule  losses  due  to  the  oscillating  electric  field 
of  the  exciton  polarization  penetrating  into  the  substrate.  Es- 


The  modification  of  the  electric  fields  in  the  presence  of 
the  reflecting  substrate  leads  to  substantial  changes  of  the 
properties  of  ID  exciton-polaritons  in  a  dipole-coupled  chain 
above  the  substrate.  The  substrate  modifies  the  interaction  of 
the  exciton  with  vacuum  photons  and,  in  addition,  can  en¬ 
gage  a  new  and  efficient  interaction  channel — with  surface 
excitations  of  the  substrate.  While  these  statements  also  ap¬ 
ply  to  a  well-studied19,20  case  of  a  single  molecule  near  the 
interface,  the  cooperative  interaction  of  many  molecules  in 
the  chain  and  of  their  images  may  lead  to  qualitatively  and 
quantitatively  interesting  effects.  In  this  paper,  we  discussed 
such  effects  for  coherent  delocalized  ID  exciton  polaritons 
that  are  well  characterized  by  their  wave  vectors  q.  We  em¬ 
phasize  that  it  is  actually  individual  q  states  that  would  ex¬ 
hibit  qualitatively  different  coherent  properties — if  averaged 
uniformly  over  all  q,  a  (near)  reduction  would  occur  to  the 
effects  of  the  substrate  on  an  individual  molecule  [similarly 
to  discussed  Eq.  (10)  for  a  vacuum].  Immediately  noticeable 
in  our  illustrative  examples,  for  instance,  is  a  difference  of 
the  behavior  of  x-  and  y-polarized  ID  exciton  polaritons — 
while  those  cases  of  the  molecular  transition  dipoles  parallel 
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q/k 


q/k 


q/k 


FIG.  7.  As  in  Fig.  6  but  for  d=50a  and  two  different  dielectric  functions  (top  to  bottom):  e=-2.0+0.1z  and  e=-10. 0  +  0.5;. 


to  the  interface  would  be  equivalent  for  single  molecules. 
The  properties  of  ID  exciton  polaritons  depend  specifically 
on  their  polarization,  frequency  with  respect  to  the  substrate 
dielectric  dispersion,  and  distance  from  the  interface.  We  re¬ 
iterate  that,  while  our  illustrations  used  parameters  appropri¬ 
ate  for  molecular  systems,  qualitatively  similar  effects  are 
expected  for  other  systems  such  as  chains  of  nanoparticles. 

A  common  feature  for  all  exciton  polarizations  and  con¬ 
ditions  considered  (see  Figs.  1,  2,  and  5)  is  that  the  rate  of 
the  radiative  decay  into  vacuum  photons  in  the  presence  of 
the  substrate  vanishes  as  |</|  approaches  k  =  a>/c  from  the 
lower  side — that  is,  where  the  exciton  dispersion  curve 
would  cross  the  vacuum  photon  dispersion  line  (see,  e.g.. 
Fig.  3).  This  is  distinctly  different  from  the  case  of  the  chain 
in  the  vacuum  where  the  radiative  decay  rate  (1)  generally 
remains  finite  at  |g|  =k-0  and  discontinuously  goes  to  zero  at 
|z/|  =  /r+0;  in  vacuum,  only  excitons  polarized  parallel  to  the 
chain  exhibit  continuous  vanishing  at  |^|=A:.  The  conse¬ 
quence  of  this  effect  of  the  image  dipoles  is  that,  near  the 
substrate,  no  branch  splitting  occurs  for  the  chain  polaritons 
at  |</|  =  £.  At  distances  from  the  interface  comparable  to  the 
wavelength  2ir/k  of  vacuum  photons,  the  interference  with 
the  radiative  fields  of  the  image  dipoles  results  in  undulated 
patterns  of  the  radiative  decay  [panels  (a)-(c)  of  Figs.  1  and 
5]  as  functions  of  q;  the  larger  the  distance,  the  more  undu¬ 
lations  would  take  place. 

It  is  important  to  note  that,  as  a  result  of  integration  (19) 
over  many  molecules  in  the  chain,  the  magnitudes  of  these 
and  some  other  ^-dependent  features  fall  off  with  distance 
from  the  interface  slower  than  their  averages  characteristic  of 
single  molecules.  Consider,  e.g.,  the  electrostatic  exciton  en¬ 
ergy  shifts  due  the  interaction  of  the  molecular  dipole(s)  with 
their  image(s).  For  a  single  molecule,  the  corresponding  tran¬ 
sition  energy  shift  is  ~p2/di.  The  maximum  of  ^-dependent 
shifts,  however,  as  Eqs.  (23)-(25)  show,  would  be 
~p2/ad2 — that  is,  much  larger  at  distances  cl>  a  (not  in  the 
immediate  proximity  of  the  interface).  This  may  provide  an 
opportunity  for  easier  experimental  identification  of  such 
shifts  than  for  individual  molecules. 

While  the  distance  dependence  (26)  of  the  decay  rate  of 
the  excitons  into  surface  plasmons  is  the  same  as  for  single 


molecules,  ^-dependent  enhancing  factors  (27)  and  (29), 
seen  as  the  peaks  in  our  illustrations,  may  facilitate  better 
experimental  verifications.  As  the  rate  of  the  emission  of  SPs 
is  q  dependent,  we  speculate  that  the  chain  excitons  could 
perhaps  serve  as  directional  sources  of  SPs — in  expectation 
of  a  more  efficient  emission  along  the  chain  for  x-  and 
z-polarized  excitons  and  perpendicular  to  the  chain  for 
y-polarized  excitons.  We  recall  that  these  polarization  assign¬ 
ments  stem  from  the  fact  that  the  electric  field  of  the  SPs  lies 
in  the  plane  made  by  the  wave  vector  and  the  normal  to  the 
interface.  As  the  small-g  exciton  decay  into  SPs  can  be 
strongly  enhanced  in  comparison  with  the  decay  into  vacuum 
photons,  it  is  likely  that  the  inverse  process  of  the  excitation 
of  the  chain  by  SPs  could  also  be  exploited.  Nonemissive 
exciton  polaritons  with  larger  q  may  also  present  an  oppor¬ 
tunity  to  be  used  for  SP  guiding.23 

Various  scattering  and  dissipation  processes  are  known  to 
be  able  to  strongly  affect  features  characteristic  of  idealized 
systems.  We  particularly  discussed  the  chain  exciton  quench¬ 
ing  due  to  the  “incoherent”  energy  transfer  to  the  substrate — 
and  this  does  not  exhaust  the  list.  It  suffices  to  also  mention, 
e.g.,  the  scattering  by  phonons  in  molecular  chains  or  Joule 
losses  in  the  chains  of  metallic  nanoparticles.  The  fast  scat¬ 
tering  between  different  q  states  would  result  in  the  thermal- 
ized  population  of  the  exciton  polaritons  so  that  the  observed 
decay  rates  are  thermally  averaged  (see  Refs.  7,  9,  and  27  for 
some  specific  ID  applications).  We  therefore  presume  that 
the  best  conditions  to  experimentally  address  finer 
^-dependent  effects  we  discussed  in  this  paper  would  be  low- 
temperature  spectroscopic  measurements. 
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Single-molecule  renormalization  is,  to  a  large  extent,  not  impor¬ 
tant  for  our  interest  in  this  paper,  and  we  refer  the  reader  to 
numerous  studies  dedicated  to  it.  We  will  completely  disregard 
the  vacuum  real  energy  shift  of  a  single  molecule  which  can  be 
thought  of  as  absorbed  in  Ee.  By  the  same  token,  we  do  not 
discuss  any  modifications  of  Ee  in  the  presence  of  the  interface 
that  may  take  place  due  to  the  interactions  outside  of  our  scope. 
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Due  to  solvation,  excess  charge  carriers  on  Id  semiconductor  nanostructures  immersed  in  polar 
solvents  undergo  self-localization  into  polaronic  states.  Using  a  simplified  theoretical  model  for  small- 
diameter  structures,  we  study  low-frequency  dynamical  properties  of  resulting  Id  adiabatic  polarons.  The 
combined  microscopic  dynamics  of  the  electronic  charge  density  and  the  solvent  leads  to  macroscopic 
Langevin  dynamics  of  a  polaron  and  to  the  appearance  of  local  dielectric  relaxation  modes.  Polaron 
mobility  is  evaluated  as  a  function  of  system  parameters.  Numerical  estimates  indicate  that  the  solvated 
carriers  can  have  mobilities  orders  of  magnitude  lower  than  the  intrinsic  values. 

©  2008  Elsevier  B.V.  All  rights  reserved. 


1.  Introduction 

One-dimensional  (fd)  semiconductor  (SC)  nanostructures  like 
nanowires  and  nanotubes  are  fascinating  objects  offering  a  promise 
for  various  applications,  including  electronics,  energy  harvesting 
and  sensors.  The  properties  of  excess  charge  carriers  are  funda¬ 
mental  for  these  applications.  A  customary  picture  used  to  de¬ 
scribe  charge  carriers  in  nanowires  [1]  and  nanotubes  [2]  is  that 
of  band  electron  states  with  very  high,  nearly  ballistic  [3],  intrinsic 
mobilities.  Many  applications,  however,  such  as  photoelectrochem¬ 
ical  energy  conversion  [4,5],  feature  SC  nanostructures  in  contact 
with  polar  liquid  environments.  We  have  recently  pointed  out  [6, 
7]  that  excess  carriers  on  Id  SCs  immersed  in  the  sluggish  polar 
medium  can  drastically  change  their  nature.  Due  to  the  long-range 
Coulomb  interaction  with  the  slow  (orientational)  polarization  of 
the  medium,  a  charge  carrier  (an  electron  or  a  hole)  gets  solvated 
forming  a  localized  electronic  state  surrounded  by  a  self-consistent 
dielectric  polarization  pattern  as  shown  schematically  in  Fig.  1.  In 
physical  parlance,  self-trapped  states  formed  by  this  mechanism 
are  traditionally  called  polarons,  extensively  studied  in  3d  ionic 
crystals  and  polar  SCs  [8,9].  Another  related  3d  notion  is  that  of 
the  solvated  electron  in  liquids  [10-12]. 

The  formation  of  Id  polarons  is  expected  to  have  an  impact 
on  various  processes  such  as  electron-hole  separation  from  bound 
excitonic  states  [6,7],  charge  transfer  reactions,  local  optical  ab¬ 
sorption  [13]  and  charge  carrier  transport.  As  we  discuss  the  low- 
frequency  polaron  dynamics  in  this  Letter  within  a  simplified  theo¬ 
retical  framework,  it  will  be  particularly  illustrated  that  the  mobili¬ 


*  Corresponding  author. 

E-mail  address:  yuri.gartstein@utdallas.edu  (Yu.N.  Gartstein). 

0375-9601/$  -  see  front  matter  ©  2008  Elsevier  B.V.  All  rights  reserved. 
doi:10.1016/j.physleta.2008. 07.031 


ties  of  the  solvated  carriers  can  be  orders  of  magnitude  lower  than 
the  intrinsic  values.  We  note  that  the  system  we  consider  appears 
an  interesting  model  realization  in  the  general  context  of  quantum 
particles  interacting  with  a  dissipative  environment  [14].  While  we 
restrict  our  discussion  to  rigid  SC  nanostructures  with  good  intrin¬ 
sic  electronic  properties  and  favoring  the  formation  of  continuum 
adiabatic  polarons,  the  generic  aspects  apply  to  other  systems  as 
well.  Basko  and  Conwell  [15]  emphasized  a  conceptually  similar 
formation  of  hole  polarons  in  DNA  in  aqueous  electrolyte  solutions 
and  discussed  the  effect  of  water  drag  on  their  mobility  [16], 

The  parametric  scales  for  the  spatial  extent  and  the  binding  en¬ 
ergy  of  the  3d  polaron  are  established  by  the  3d  Coulomb  center 
problem  where,  for  a  carrier  of  charge  q  and  effective  mass  m  in 
the  uniform  medium  of  effective  dielectric  constant  e*.  the  corre¬ 
sponding  3d  Bohr  radius  and  Rydberg  are1 

i iB=e*h2/mq 2,  Ry  =  q2/2e*aB-  (1) 

As  a  result  of  the  confinement  of  charge  carrier  motion  to  the  Id 
SC,  the  effects  of  Coulomb  interactions  are  generally  known  [17] 
to  get  amplified,  and,  in  fact,  the  binding  energy  £jJo1  of  the  con¬ 
fined  polaron  can  increase  substantially  in  comparison  with  the  3d 
binding  [6,7],  Our  interest  is  in  the  strong  confinement  regime: 
OB  S>  R  (R  is  the  transverse  radius  of  the  Id  structure),  when  the 
resulting  longitudinal  spatial  extent  of  the  polaron  is  much  larger 
than  R  (see  Fig.  2(b))  and  the  self-localized  charge  carrier  can  be 
adequately  described  by  the  Id  wave  function  t/r(x),  x  being  the 
coordinate  along  the  structure  axis. 


1  As  we  do  not  compare  polarons  and  excitons  in  this  Letter,  our  definitions  here 
are  based  on  the  electron  mass  rather  than  on  the  exciton  reduced  mass  used  in 
Refs.  [6,7], 
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Fig.  1.  Schematically:  Solvated  excess  electron  charge  density  (thick  line)  along  the 
Id  semiconductor  nanostructure  which  is  surrounded  by  a  pattern  of  the  orienta¬ 
tional  polarization  of  solvent  molecules. 

We  consider  the  polaron  to  be  formed  due  to  the  interaction 
of  an  excess  carrier  with  the  slow  polarization  component  of  the 
Debye  solvent  described  by  the  frequency  cu-dependent  dielectric 
function 

.  .  ,  ^s  foo 

e(a>)  —  Coo  +  ; - : - j  (2) 

1  —  IOITd 

for  which  the  effective  dielectric  constant  in  Eq.  (1)  would  be  de¬ 
termined  from  the  well-known  [8,11,12]  relationship 

1/e*  =  l/^oo  —  l/e,,  (3) 


2.  General  adiabatic  polaron  relationships 

In  the  adiabatic  picture,  the  excess  charge  carrier  instanta¬ 
neously  responds  to  the  electrostatic  potential  tp(x,  t )  that  is  slowly 
changing  in  time  by  staying  in  the  lowest  eigenstate  of  the  station¬ 
ary  Schrbdinger  equation: 

h2  32 

Hf(x,t)  =  Et(x,t),  H-—  I  q<l>(x,l),  5 

2m  dx2 

where  time  t  thus  enters  only  as  a  parameter.  The  corresponding 
time-dependent  Id  charge  density  is 

p(x,t)  =q|yk(x,tj|2,  (6) 

where  q  is  the  charge  of  the  carrier. 

The  electrostatic  potential  cj>(x,t),  in  turn,  can  be  represented 
as  a  sum  of  two  contributions: 

<j>{x,t)  =  4n{x,t)  +  4>s{x,t),  (7) 

where  the  first  term  represents  the  potential  “independent”  of  the 
charge  carrier  itself,  that  is,  the  potential  that  would  be  there  if 
the  charge  carrier  was  absent— as  exemplified  by  applied  fields  and 
dielectric  fluctuations  of  the  environment.  The  second  term  in  (7) 
is  the  part  that  is  induced  by  the  carrier  charge  density  (6)  in  the 
past  via  the  time-dependent  medium  response: 

OO 

<ps(x,  t)  =  J  dt'  J  dx!  Q(x' ,t')p(x  —  x' ,t  —  t').  (8) 

o 

Eq.  (8)  assumes  the  translationally  invariant  response  so  that 


with  es  3>  Coo  for  typical  solvent  values  of  static  es  and  high- 
frequency  Coo  constants  [18].  The  characteristic  time  r d  in  Eq.  (2) 
is  the  Debye  (transverse)  relaxation  time,  for  the  problem  at  hand, 
however,  the  solvent  response  occurs  at  generally  much  shorter 
time  scales  corresponding  to  the  longitudinal  relaxation  time  [12, 
18,19] 


D.  = 


(4) 


Typical  solvents  are  characterized  by  a  wide  range  of  %i  ranging 
from  fractions  to  tens  of  ps  [6,18].  With  the  estimated  polaron 
binding  energies  E^01  on  the  order  of  0.1  eV,  the  dynamic  adi- 
abaticity  is  ensured  by  fi/r^  <SC  E^01.  For  room  temperatures  T, 
on  the  other  hand,  one  can  also  have  I<bT  <g  E £o1  satisfied,  thus 
making  thermal  population  of  higher  energy  electronic  states  neg¬ 
ligible. 

The  aim  of  this  Letter  is  to  elucidate  the  response  of  such  Id 
polarons  to  weak  perturbations  of  low-frequencies  a> :  hco  <K  E^°\ 
which  do  not  cause  transitions  between  different  electronic  lev¬ 
els.  This,  in  particular,  encompasses  such  basic  phenomena  as 
polaron  drift  caused  by  an  applied  field  and  polaron  diffusion 
caused  by  the  dielectric  fluctuations  of  the  surrounding  solvent. 
We  stress  that  our  consideration  is  limited  to  a  simplified  theo¬ 
retical  model  whose  main  purpose  is  to  address  elements  of  the 
essential  physics.  We  will  show  how  the  underlying  combined  mi¬ 
croscopic  dynamics  of  the  electronic  charge  density  on  the  Id  SC 
and  the  solvent  results  in  point-particle-like  Langevin  dynamics 
of  the  polaron  center.  The  derived  mobility  and  diffusion  coeffi¬ 
cient,  as  should  be  expected  [14],  satisfy  the  Einstein  relation.  In 
addition,  we  demonstrate  the  existence  of  local  relaxation  modes 
around  the  polaron  whose  relaxation  times  are  somewhat  longer 
than  bare  solvent’s  r l.  These  modes  involve  polaron  shape  varia¬ 
tions  and  presumably  could  manifest  themselves  in  responses  to 
time-dependent  fields  as  well  as  in  collisions. 


0s(k,  co)  =  Q(k,  co)p(k,  co). 


in  Fourier  space.  The  response  function  Q  is  determined  from  the 
solutions  of  the  corresponding  3d  electrostatic  potential  problem 
using  the  appropriate  geometry  of  elementary  charge  distributions 
and  of  the  dielectric  environment.  Let  us  denote  such  a  solution 
derived  for  a  particular  value  e(co)  of  the  medium  dielectric  func¬ 
tion  as  V (k,  e(&>)).  One,  of  course,  recognizes  that  Q  should  not  in¬ 
clude  the  instantaneous  Coulomb  interaction  of  the  charge  carrier 
with  itself;  in  the  context  of  the  Debye  solvent  (2),  the  medium 
for  this  case  is  characterized  by  Coo.  In  these  terms, 

G(k,  co)  =  V(k,  e  (to))  -  l 7  (ft.  Coo),  (9) 

the  relevant  response  is  thus  due  to  the  slow  (orientational)  com¬ 
ponent  of  the  dielectric  polarization  of  the  medium.  Explicit  ex¬ 
amples  of  such  calculations  for  the  nanotube  geometry  with  the 
account  of  tube’s  own  polarizability  can  be  found  in  Ref.  [6[.  When 
the  dielectric  polarization  effects  are  well  captured  by  the  model 
of  a  uniform  3d  medium,  which  is  a  good  approximation  in  the 
strong  confinement  regime  [6,20],  the  response  Q  is  separable 
into  a  product  of  the  spatial  and  temporal  components  (all  bare 
Coulomb  interactions  are  simply  decreased  by  a  factor  of  the  in¬ 
verse  dielectric  constant).  We  will  write  this  as 

G(x,t)  = -A(t)G(x)  or  G(kco)  = -ot(co)g(k).  (10) 


To  gain  more  physical  insight  and  obtain  explicit  simpler  relation¬ 
ships,  we  will  restrict  our  attention  in  this  Letter  to  this  regime.  In 
accordance  with  Eq.  (9),  the  temporal  part  for  the  Debye  solvent 
(2)  can  be  described  as 


1 

ot(w)  =  — 

6oo 


1 

e(co) 


otp 

1  —  iwtL  ’ 


(H) 


The  relaxation  time  tl  that  determines  the  pole  of  the  inverse  di¬ 
electric  function  as  well  as  the  familiar  combination  (3)  naturally 
arise  here.  As  is  well  known  [12,18,19],  to  characterizes  the  di¬ 
electric  relaxation  in  circumstances  controlled  by  electrostatic  po¬ 
tentials  (electric  fields  E)  while  r i  is  appropriate  in  circumstances 
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controlled  by  charge  densities  (electric  inductions  D).  Evidently, 
the  latter  is  the  relevant  situation  here,  and  we  will  use  r  =  Tl 
to  simplify  the  notation  in  what  follows.  In  the  time  domain,  the 
response  function  is  therefore 

A(t)  =[—  e~imta(co)  =  —  e~^T0(t),  (12) 

J  2jt  x 

where  0  is  a  step  function. 

The  spatial  part  G(x)  of  the  response  (10)  is  determined  by  the 
specifics  of  the  transverse  charge  distribution  in  our  Id  SC.  For  the 
illustrative  calculations  below,  we  will  use  the  case  of  the  nan¬ 
otube  geometry  where  (see,  e.g.,  Refs.  [6,21]) 

g(k)  =  2I0(kR)I(o(kR),  (13) 

with  / o  and  K o  being  the  modified  Bessel  functions  and  R  the 
tube  radius.  Equation  (13)  essentially  corresponds  to  the  Fourier 
transform  of  the  Coulomb  interaction  between  two  charged  rings 
of  radius  R. 

Taking  advantage  of  the  particular  simple  form  (12),  the  tempo¬ 
ral  evolution  of  the  potential  cf>(x,  t),  as  per  Eqs.  (7)  and  (8),  can  be 
conveniently  converted  into  the  differential  (over  time)  equation: 


9</>(x,  V  4>(*A) 

3 1  r 

_  3 </>,  (*.  0  </><  (*.  f) 

3 1  x 


ao 

X 


J  dx'  G(x  —  x')p(x',  t). 


(14) 


3.  The  stationary  solution  and  small  perturbations  around  it 


One  first  discusses  the  dynamics  in  the  absence  of  the  external 
potential  <pi(x,t),  and  the  starting  point,  of  course,  is  the  con¬ 
sideration  of  the  ground-state  stationary  polaron  problem.  In  this 
basic  case,  the  resulting  charge  density  and  potential  are  time- 
independent,  we  denote  their  spatial  distributions  as  p(x)  and 
4>(x),  respectively.  Their  relationship  is  determined  by  the  static 
medium  response,  or  a(0)  =  cto  in  Eqs.  (10),  (11).  That  is  also  im¬ 
mediately  seen  from  Eq.  (14): 

4>(x-x0)  =  -a0  J  dx'G(x-x')p(x'-x0),  (15) 

where  we  explicitly  emphasized  the  translational  invariance  by 
indicating  that  the  stationary  solutions  can  be  centered  around  ar¬ 
bitrary  points  xq.  (The  negative  sign  in  the  r.h.s.  of  Eq.  (15)  and 
in  other  expressions  below  signifies  that  a  positive  charge  density 
leads  to  a  stabilizing  negative  potential,  and  vice  versa.)  As  p(x)  is 
determined  (6)  by  the  ground-state  charge  carrier  wave  function 
obeying  Eq.  (5),  the  latter  wave  function  is  then  found  from 
the  self-consistent  non-linear  Schrodinger  equation 


polarons.  The  standard  lowest-order  perturbation  theory  with  re¬ 
spect  to  V  then  results  in  the  perturbed  electron  density  p(x,  t )  = 
p(x )  +  p(x,  t)  with 

p(x,t)  =  —  J  dx'  L(x,x!)4>(x! ,t),  (16) 

where  the  Hermitian  kernel 


L(x,x')  =  q2J2 
!>0 


fS(.x)jrtW1rf(x!)1r  o(x')  +  c.c. 
Ei  -  Eq 


is  built  out  of  the  solutions  of  the  unperturbed  problem 

Hti{x)  =  Eiiri(x). 


The  density  perturbation  (16)  should  self-consistently  determine 
the  potential  perturbations  (8)  at  future  times,  that  is: 

OO 

0(x,  t)  =  jdt'fdx'g(x\t')p(x-x\t-t').  (17) 

o 

Together,  Eqs.  (16)  and  (17)  determine  the  eigenmodes  of  the  un¬ 
driven  dynamics  around  the  stationary  polaron  in  the  case  of  a 
general  response  Q.  In  the  separable  case  (10),  the  problem  of  find¬ 
ing  eigenmodes  evidently  reduces  to  a  simpler  generalized  eigen¬ 
value  problem  which  we  write  in  the  form  of 


Pn(x)  =  - J  dx' L(x,x')<pn(x'),  (18a) 

cf>n(x)  = -kna o  J  dx'  G(x  —  x')pn(x').  (18b) 

In  practical  terms,  the  generalized  problem  (18)  is  easily  con¬ 
verted  to  a  normal  eigenvalue  problem  and  readily  solved  in 
Fourier  (wave  vector)  space.  The  eigenfunctions  of  this  problem 
form  a  complete  orthogonal  set.  To  save  space  on  notation,  here 
we  choose  to  normalize  this  set  without  extra  coefficients  as 

—  J  dx  pm(x)cpn(x)  =  &nm  (19) 

(all  functions  can  be  chosen  real).  The  eigenvalues  kn  >  1  (n  ^  0), 
in  turn,  determine  the  eigenfrequencies  a>„  from 


ot(a>„)  =  Xnoi0. 


(20) 


Unlike  the  oscillating  solutions  in  Refs.  [22-25],  for  the  Debye 
solvent  with  its  dissipative  dynamics  (11),  Eq.  (20)  leads  to  a  set  of 
relaxation  times  r„: 


i 

con  = - . 

xn 


x 

=  1-1/V 


(21) 


ft2  d2\k(x)  q2  E  ,  ,i  ,  ,2 
~2m~dx 2 - —  J  dx  G(x-x)\xlr(x)\  f(x)  =  Ef{x) 

as  corresponding  to  its  lowest  eigenvalue  E  —  E o.  This,  of  course, 
is  the  same  equation  one  would  derive  from  the  optimization  of 
the  adiabatic  system  energy;  it  results  in  self-localized  (polaronic) 
solutions  as  we  discussed  in  Refs.  [6,7]. 

With  the  ground-state  polaron  solution  in  place,  one  can 
now  consider  the  effects  of  small  potential  perturbations  4>(x,t): 
<p(x ,  t)  =  4>(x)  +  </>(x,  f),  in  the  stationary  Schrodinger  problem  (5): 
H  =  H  +  V, 

h2  d2 

H  =  -  — ^ +q0(x),  V  =  q</>(x,t). 

The  overall  approach  here  is  very  similar  to  used  in  previous  stud¬ 
ies  of  small  perturbations  around  3d  [22]  and  Id  [23-25]  optical 


The  temporal  evolution  of  the  eigenfunctions  of  the  dynamical 
problem  would  therefore  be  “ordinarily”  decaying: 

pn(x,  t)  oc  pn(x)e~c/Tn . 

Fig.  2(a)  illustrates  how  these  relaxation  times  r„  converge  to  the 
“bare”  value  of  r  upon  the  increase  of  the  mode  index  n— for  this 
illustrative  calculation  we  used  the  tubular  case  (13)  with  the  con¬ 
finement  parameter  afl/R  =  10.  One  immediately  notices,  however, 
that  the  dynamics  has  a  very  different  character  for  the  mode 
with  Ao  =  1  corresponding  to  coq  =  0.  This  is  the  so-called  zero- 
frequency  translational  mode  whose  existence  is  a  general  feature 
of  quantum-field  models  with  non-trivial  spatial  solutions  and  is 
a  consequence  of  the  translational  invariance  [26].  As  discussed 
above,  the  static  polaronic  solutions  are  degenerate  with  respect 
to  the  position  of  the  polaron  center.  An  infinitesimal  displacement 
along  x  corresponds  to  the  spatial  derivative  of  the  static  solutions 


5912 


Yu.N.  Gartstein,  G.L  Ussery  /  Physics  Letters  A  372  (2008)  5909-5914 


where  the  first  terms  correspond  to  the  shapes  of  the  static  so¬ 
lutions  that  are  centered  around  £(t).  The  set  of  time-dependent 
coefficients  an(t)  (n  ^  1)  and  the  centroid  coordinate  $(t)  then 
constitute  complete  coordinates  of  the  system  and  can  be  conve¬ 
niently  used  to  describe  its  dynamics. 

Using  Eqs.  (24)  in  Eq.  (14)  for  the  potential  evolution,  one  con¬ 
verts  the  latter  into 


-I 


0'  +  '^2an<t>'n  +  7^, 

n>  1  -* 


■  ,  a n 

On  4 - 

Tn 


cj>n  =  F(X,  f). 


where  the  external  perturbation  part  is 

d  <pi(x,t)  Mx,t) 


(25) 


(26) 


Fig.  2.  Illustration  of  local  relaxation  modes  (tubular  geometry  with  as/R  =  10): 
(a)  Relaxation  times  for  the  first  n  =  0  to  8  eigenmodes;  (b)  Shapes  of  the  spatial 
density  variations  pn(x)  for  the  first  3  modes  as  indicated  by  values  of  the  index  n 
next  to  thin  lines.  These  lines  are  arbitrarily  scaled  here  to  be  better  displayed.  The 
thick  line  corresponds  to  the  scaled  static  polaron  density  aBP(x)/q. 

over  x  (denoted  below  with  symbol  ');  one  indeed  easily  verifies 
that 

PoCO  =  y~1,2p'(x),  <t>o(x )  =  y_1/20'(X)  (22) 

satisfy  Eqs.  (18)  with  Xq  =  1.  Here  the  normalization  factor  involves 

Y  =  J  dx\p'(x)4>\x)\.  (23) 

As  an  example,  Fig.  2(b)  shows  spatial  patterns  corresponding  to 
the  first  3  eigenmodes  in  comparison  with  the  static  polaron  shape 
in  terms  of  the  charge  density.  Of  course,  all  density  variations  sat¬ 
isfy  /  dxpn(x )  =  0.  Given  the  numerical  nature  of  the  solution,  we 
have  confirmed  that  the  translational  mode  (n  =  0)  precisely  fol¬ 
lows  the  spatial  derivative  of  the  static  polaron  shape.  The  next 
mode  (n  =  1)  can  be  qualitatively  thought  of  as  corresponding  to 
the  variation  of  the  polaron  width.  The  convergence  pattern  in 
panel  (a)  and  the  spatial  variations  displayed  in  panel  (b)  of  Fig.  2 
are  reminiscent  of  those  discussed  in  connection  with  Id  optical 
polarons  [23-25], 

As  is  well  known  [26,27],  in  the  expansions  of  the  arbitrary 
spatial  patterns  over  the  complete  set  of  eigenmodes,  the  zero- 
frequency  mode  should  be  excluded  in  favor  of  the  collective  coor¬ 
dinate  £(t)  of  the  polaron  center  position  (polaron  centroid).  Such 
expansions  for  the  potential  and  density  variations  then  take  the 
form  of 

</>(*,  t)  =  4>(x  -  f  (t))  +  <*n(t)<t>n(x  -  £(t)),  (24a) 

n^l 

P(x,  t)  =  p(x-  £(t>)  +  J2an(t)Pn{x  -  (24b) 

n>l 


As  in  Eqs.  (24),  all  potential  expansion  functions  in  Eq.  (25)  have 
the  same  argument  (x  —  £(t)). 

4.  Equations  of  motion  and  centroid  dynamics 

One  derives  the  equations  of  motion  for  the  set  of  coordinates 
[|(t),  on(t)}  by  projecting  Eq.  (25)  with  each  of  p„(x  — £(t))  ( n  ^  0) 
and  utilizing  the  orthonormality  relationships  (19).  Now  note  that 
we  are  interested  in  and  have  been  developing  our  framework  to 
study  the  lowest-order  effects  of  small  perturbations.  Correspond¬ 
ingly,  the  system  responses  to  such  perturbations,  the  centroid 
velocity  |(t)  and  shape  deformation  coefficients  an(t),  are  small. 
Therefore,  the  second-order  term  in  Eq.  (25)  (containing  products 
of  §  and  an )  can  safely  be  dropped.  The  result  is  then  a  set  of  fully 
decoupled  equations  of  motion: 


yi  =  f,  (27) 

Gn  +  Gn/^n  = /n>  Tl  ^  1,  (28) 

where 

f($,t)  =  f  dxp'(x)F(x  +  $,r)  (29) 

and 

f„Q,t)  =  -  J  dxpn(x)F(x  +  $,t). 

Eqs.  (27)  and  (28)  evidently  allow  one  to  easily  study  various  dy¬ 
namic  responses.  Here  we  will  concentrate  on  the  centroid  dynam¬ 
ics. 

The  external  (perturbation)  potential  <pi(x,  t )  can  be  represented 
as  a  sum  of  two  contributions: 

0i(X,  t)  =  0a(x,  t)  +  <pf(x,t), 

where  4>a(x,t)  is  due  to  the  applied  field  while  4>f(x,t)  is  due 
to  the  “bare”  fluctuations  of  the  dielectric  polarization  (here,  of 
course,  only  its  slow,  orientational,  component  is  relevant).  We  will 
correspondingly  use  subscript  indices  a  and  /. 

Let  us  consider  first  the  motion  of  the  polaron  under  the  action 
of  a  constant  applied  potential  4>a(x)  that  can  slowly  (on  the  scale 
of  the  polaron  size)  change  in  space.  From  Eqs.  (26)  and  (29)  it 
immediately  follows  that 

r  rt\  f  A  r  +  f  j  ,9</’a(x  +  £) 

/a(f)  =  J  dxp(x) - - - =-J  dxp(x) — — - 

ItstJ/ *„(»)=—,  (so 

where  E  is  the  electric  field  acting  on  the  polaron. 

On  the  other  hand,  the  fluctuating  contribution  //(£,t)  is  a 
random  process  whose  correlations  have  to  be  determined  from 
the  correlations  of  4>f(x,t)  subsequently  using  definitions  (26)  and 
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(29).  The  bare  classical  ( I<bT  ($>h/ r)  fluctuations  are  easily  found 
by  using  the  fluctuation-dissipation  theorem  for  the  specified  Id 
geometry,  the  result  is 

(<pf(x,  t)cpf(x',  t'))  =  a0kBTG(x  -  x')e_|t"t'l/r.  (31) 

With  Eq.  (26),  this  translates  exactly  into 

(Ff(x,  t)Ff(xr,  t'))  =  ^^G(x  -  x')S(t  -  t').  (32) 

Given  the  time-uncorrelated  character  of  Eq.  (32),  the  relevant  cor¬ 
relations  of  f f  are  those  for  the  same  centroid  coordinate: 

(//«.  0//($,  t'))  =  ^(t  -  f).  (33) 


To  derive  Eq.  (33)  from  Eqs.  (29)  and  (32),  we  have  also  used 
Eq.  (15)  in  the  following: 


J  dxdx'  p’ (x)p’ (x!)G(x  — x!) 


p’(x)<p’(x) 

0/0 


Y_ 

ot0 


Introducing  the  random  velocities  rj(t)  =  //(t)/y,  the  equation 
of  motion  (27)  for  the  polaron  centroid  under  the  action  of  a  con¬ 
stant  field  can  be  conveniently  rewritten  as 


|  =  pq£  +  q(t ),  (34) 

where  the  mobility 

M  =  1/ytr  (35) 


is  defined  with  respect  to  the  dynamical  force  q£  while  the  ran¬ 
dom  process  q(t),  as  per  Eq.  (33),  satisfies 

{r,(t)r)(t'))  =  2D8(.t-t')  (36) 

with  the  diffusion  coefficient 


Fig.  3.  The  polaron  mobility  in  the  constant  applied  field  as  a  function  of  the  con¬ 
finement  parameter  a^/R,  here  the  mobility  is  measured  in  units  of  fio  =  aB/TLRy 
which  itself  can  strongly  depend  on  system  parameters. 

For  the  overall  dependence  of  the  polaron  mobility  on  system  pa¬ 
rameters,  one  should  recall  that  the  reference  value  of 

pt0  =  a2B/rLRy  (39) 

is  itself  strongly  parameter-dependent  as  per  Eqs.  (1)  and  (4). 

Using  Eqs.  (38)  and  (39),  one  can  establish  how  the  mobility 
scales  when  some  of  system  parameters  vary.  The  effect  of  the 
radius  R  on  the  mobility  is  already  explicit  in  Eq.  (38).  For  the 
dependence  on  the  effective  mass  m,  one  finds  an  approximate 
scaling  law 

/zocm-1'6.  (40) 

The  dielectric  properties  of  the  solvent  are  also  important  here:  if 
one  were  to  separate  the  effect  of  Tl  on  pt  the  remaining  factor  for 
the  mobility  would  feature  the  dependence  on  e*  as 


D  =  kBT/y  r.  (37) 

Eq.  (34)  is  the  standard  overdamped  Langevin  equation  [12]  for  a 
point  particle  as  consistent  with  our  assumption  of  the  field  having 
practically  no  spatial  variation  on  the  scale  of  the  polaron  size.  We 
emphasize  that  ingredients  of  Eq.  (34)  have  been  derived  from  the 
underlying  microscopic  dynamics.  Our  mobility  (35)  and  diffusion 
coefficient  (37)  manifestly  obey  the  Einstein  relation 

D  =  fiksT , 

thus  ensuring  that  the  macroscopic  dynamics  of  the  polaron  in  the 
steady  state  leads  to  the  Boltzmann  distribution.  We  note  that  ex¬ 
pression  (35)  is  similar  in  form  to  the  one  obtained  from  energy 
conservation  considerations  by  Basko  and  Conwell  [15,16]  but  with 
the  important  quantitative  distinction  that  our  r  =  tj.  is  the  lon¬ 
gitudinal  relaxation  time  rather  than  to  they  used  (the  difference 
that  can  easily  exceed  an  order  of  magnitude). 

Our  discussion  of  the  constant  applied  field  is  straightforwardly 
generalized  to  time-dependent  fields.  As  Eq.  (26)  suggests,  for 
fields  varying  as  £(t)  =  £q  exp(— icot),  the  first  term  in  Eq.  (34) 
would  get  modified,  with  1/r  being  replaced  by  —ico  +  1/r.  In 
addition,  time-dependent  fields  would  be  exciting  polaron  shape 
deformations  (28)  as  we  will  discuss  elsewhere. 

The  calculation  of  the  mobility  (35),  within  the  studied  model, 
is  thus  reduced  to  the  evaluation  of  Eq.  (23)  based  on  the  ground- 
state  of  the  polaron.  Fig.  3  displays  results  of  such  a  calculation  for 
the  tubular  interaction  (13)  in  the  scaled  form  that  separates  the 
specific  dependence  of  the  mobility  on  the  confinement  parame¬ 
ter  aB/R.  Within  the  shown  region  of  its  variation,  the  behavior 
exhibited  in  Fig.  3  may  be  approximated  by  a  power-law: 

(M/Mo)  (aB/R)~^  4.  (38) 


zL  ■  pi  cx  (e*)2'6.  (41) 

With  typical  es  3>  €oo,  €*  ~  To  appreciate  the  effect  on  the 
mobility  itself,  however,  one  is  reminded  that  the  longitudinal  time 
r l  (4)  is  also  proportional  to  €oo-  Eqs.  (40)  and  (41)  illustrate  a 
quite  understandable  general  trend:  the  more  strongly  bound  the 
polaron  is,  then  the  lower  its  mobility  would  be. 

5.  Discussion 

One-dimensional  semiconductor  nanostructures  in  immediate 
contact  with  polar  liquids  is  an  interesting  class  of  systems  of 
relevance  to  many  applications,  particularly  those  involving  re¬ 
dox  reactions  [4,28  J.  Using  simplified  theoretical  models,  we  have 
demonstrated  [6,7]  that  the  solvation  of  excess  charge  carriers  due 
to  the  solvent  polarization  can  cause  carriers  on  Id  SCs  to  self- 
localize.  A  combination  of  several  factors,  small  intrinsic  effective 
masses  of  the  carriers,  enhanced  Coulomb  effects  in  Id  (small- 
diameter  structures)  and  the  sluggish  nature  of  the  orientational 
solvent  polarization,  creates  conditions  in  these  systems  that  may 
be  favorable  for  the  resulting  formation  of  Id  continuum  adiabatic 
polarons.  The  physical  properties  of  the  polarons  are  quite  differ¬ 
ent  from  those  of  the  band  states.  The  energetic  significance  of 
the  polaronic  effect  is  emphasized  by  making  comparisons  with 
much  more  studied  Wannier-Mott  excitons  in  Id  SCs.  When  com¬ 
pared  theoretically  on  an  equal  footing,  we  have  found  [6,7]  that 
the  polaron  binding  energy,  E^01,  can  be  a  substantial  fraction, 
roughly  one-third  for  charge-conjugation-symmetric  SCs,  of  the  ex- 
citon  binding  energy,  E“c,  in  the  media  with  comparable 

Let  us  take  a  widely  known  example  of  single-wall  carbon  nan¬ 
otubes  (SWCNTs).  The  importance  of  the  excitonic  effects  in  the 
optics  of  semiconducting  SWCNTs  is  well  established  now,  with 
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binding  energies  Ejjxc  experimentally  measured  in  some  tubes  to 
be  in  the  range  of  0.4-0. 6  eV  [29-31],  That  would  suggest  that 
polaron  binding  energies  can  reach  values  of  0.1  eV  and  more, 
as  it  also  follows  from  numerical  estimates,  reaffirming  the  po¬ 
tential  importance  of  polarons.  It  would  be  pertinent  to  note  that 
redox  chemistry  of  CNTs  has  been  deemed  an  “emerging  field  of 
nanoscience”  [32]  and  solvatochromic  effects  in  CNTs  are  being 
intensely  researched  [33];  interestingly,  even  the  first  mapping  of 
luminescence  versus  absorption  spectra  of  individual  SWCNTs  has 
been  achieved  in  aqueous  suspensions  [34,35]. 

In  an  effort  to  establish  experimentally  testable  signatures 
of  polaron  formation,  in  this  Letter  we  have  studied  the  low- 
frequency  dynamics  of  Id  large  adiabatic  polarons.  We  have  shown 
how  the  combined  microscopic  dynamics  of  the  electronic  charge 
density  on  the  Id  SC  and  the  solvent  results  in  macroscopic 
Langevin  dynamics  of  the  polaron  centroid.  The  mobility  (35)  and 
diffusion  coefficient  (37)  are  thus  derivable  from  the  stationary 
polaron  parameters  and  the  solvent  dynamic  relaxation.  We  also 
demonstrated  that  new  local  dielectric  relaxation  modes  develop 
in  the  presence  of  the  polaron.  These  modes  involve  polaron  shape 
deformations  and  may  play  a  role  in  time-dependent  responses 
and  in  polaron  collision  events. 

It  is  instructive  to  make  some  numerical  estimates  based  on 
the  obtained  results.  If,  for  instance,  one  chooses  a  set  of  reason¬ 
able  representative  numerical  parameters:  in  =  0.05me  ( me  being 
the  free  electron  mass),  e*  =  3  and  Tl  =  1  ps,  then,  by  Eq.  (1), 
the  resulting  ob  —  32  A,  Ry  ~  76  meV,  and,  by  Eq.  (39),  q  ■  iJt o  — 
1.3  cm2  V-1  s-1.  Using  the  data  of  Fig.  3  for  as/R  =  6  as  an  ex¬ 
ample  of  a  small-diameter  structure,  this  value  of  /xo  would  then 
translate  into  q  •  /x  ~  0.3  cm2  V-1  s-1.  At  T  =  300  I<  this  yields  the 
value  of  the  diffusion  coefficient  D  ~  8  x  10-3  cm2/s.  Of  course, 
with  a  wide  range  of  variability  of  the  relevant  system  parameters 
(suffice  to  mention  values  of  r r),  these  tentatively  estimated  values 
can  change  substantially  for  various  systems.  Still,  these  numbers 
strongly  indicate  that  the  mobility  of  the  solvated  carriers  can  be 
orders  of  magnitudes  smaller  than  the  intrinsic  carrier  mobilities 
of  103-105  cm2  V-1  s-1  discussed  for  SC  wires  and  tubes  [1,2], 

We  conclude  by  reiterating  that  the  illustrative  calculations  pre¬ 
sented  in  this  Letter  are  based  on  simplified  generic  models  whose 
main  role  is  to  elucidate  the  essential  physics.  While  we  do  not 
expect  substantial  changes  in  the  results  valid  for  small-diameter 
nanostructures  (the  strong-confinement  regime,  when  most  dielec¬ 
tric  effects  are  due  to  the  surrounding  medium  [6,7,20]),  more 
accurate  calculations  for  specific  systems,  including  a  finite  size  of 
dielectric  cavities,  SCs  band  structure  and  polarizability,  may  be  re¬ 
quired  for  detailed  comparison  with  prospective  experimental  data. 
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When  an  excess  charge  carrier  is  added  to  a  semiconducting  polymer  chain,  it  is  well  known  that  the  carrier 
may  self-trap  into  a  polaronic  state  accompanied  by  a  bond-length  adjustment  pattern.  A  different  mechanism 
of  self-localization  is  the  solvation  of  charge  carriers  expected  to  take  place  when  the  polymer  chain  is 
immersed  in  polar  media  such  as  common  solvents.  We  use  density-functional  computations  in  conjunction 
with  the  polarizable  continuum  model  to  demonstrate  solvation-induced  self-consistent  charge  localization  into 
large-radius  one-dimensional  (ID)  polarons  on  long  C^H2  carbon  chains  with  the  polyynic  structure.  Within 
the  framework  used,  the  solvation  results  in  a  much  more  pronounced  charge  localization.  We  believe  this 
mechanism  of  polaron  formation  to  be  of  relevance  for  various  ID  semiconductors  in  polar  environments. 
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A  variety  of  one-dimensional  (ID)  wide-band  semicon¬ 
ductor  (SC)  structures — conjugated  polymers,  nanowires, 
and  nanotubes — attract  a  great  deal  of  attention.  The  nature 
and  properties  of  excess  charge  carriers  on  these  structures 
are  fundamental  for  various  applications.  For  our  discussion, 
one  distinguishes  between  nearly  free  band  states  of  excess 
carriers  and  self-localized  polaronic1  states.  Self-trapping 
and  the  formation  of  ID  polarons  can  occur  owing  to  the 
(strong)  interaction  of  the  electronic  subsystem  with  another 
subsystem  such  as,  e.g.,  displacements  of  the  underlying 
atomic  lattice,  which  is  the  mechanism  that  has  been  exten¬ 
sively  explored  for  ID  electron-phonon  systems.  Here  we  are 
concerned  with  a  different,  much  less  studied  implementa¬ 
tion  of  the  polaronic  effect  that  should  take  place  for  ID  SCs 
immersed  in  three-dimensional  (3D)  polar  media,  the  situa¬ 
tion  particularly  common  for  applications  involving  funda¬ 
mental  redox  processes  in  polar  solvents.  In  this  case  of  what 
could  be  called  charge  carrier  solvation,  the  long-range  Cou¬ 
lomb  interaction  is  expected2-4  to  lead  to  the  formation  of  a 
localized  electronic  state  on  a  SC  structure  surrounded  by  a 
self-consistent  pattern  of  the  sluggish  (orientational)  polar¬ 
ization  of  the  solvent.  The  main  goal  that  we  pursue  and 
achieve  in  this  paper  is  to  substantiate  the  validity  of  this 
picture  at  the  level  of  ab  initio  computations,  that  is,  with 
realistic  many-electron  effects  taken  into  account.  To  our 
knowledge,  this  is  a  first  ab  initio  demonstration  of  the 
solvation-induced  self-consistent  localization  of  the  excess 
charge  resulting  in  large-radius  ID  polarons. 

The  type  of  systems  we  consider  is  an  interesting  model 
realization  in  the  general  context  of  quantum  particles  inter¬ 
acting  with  dissipative  environments.5,6  The  confinement  of 
the  electron  motion  to  ID  and  its  structural  separation  from 
the  3D  polarizable  medium  clearly  distinguishes  such  sys¬ 
tems  from  the  well-known  3D  cases  of  polarons  in  polar  SCs 
(Refs.  7-9)  and  solvated  electrons  in  polar  liquids.9,10  A  com¬ 
bination  of  several  factors  here  creates  conditions  that  may 
be  especially  favorable  for  the  formation  of  ID  continuum 
adiabatic  polarons;  these  are  small  effective  band  masses  of 
the  carriers  (e.g.,  —0.1  of  free-electron  mass),  enhanced 
Coulomb  effects  in  ID,11  and  the  slow  response  of  the  sol¬ 
vent  polarization. 


The  self-trapping  of  an  excess  charge  carrier  (an  electron 
or  a  hole)  is  easily  described  in  a  single-particle  picture 
within  a  standard  ID  continuum  adiabatic  framework.  It  cor¬ 
responds  to  the  localized  ground-state  solution  ip{x)  of  the 
nonlinear  Schrodinger  equation: 

h 2  cPiMx)  [  ,  , 

- - dx'V(x-x')\ifix')\-i/^x)  =  Eifr{x), 

2m  ox  J 

where  x  is  the  coordinate  along  the  structure  axis,  m  the 
intrinsic  effective  mass  of  the  carrier,  and  V(x)  the  effective 
self-interaction  mediated  by  another  subsystem.  In  the  case 
of  a  short-range  electron-phonon  mediation,  the  self¬ 
interaction  can  be  taken  local,  F(x)=g<5(.r),  leading  then  to 
the  exact  result  if{x)  ^  sech(gmx 1 2h2)  for  a  continuum  ID 
polaron  widely  known  after  the  pioneering  contributions  of 
Rashba12  and  Holstein.13  In  the  case  of  the  long-range  polar¬ 
ization  interaction,  the  effective  V(x)  behaves  as  I  /  |v|  at 
large  distances,  while  the  short-range  behavior  depends  on 
specific  system  details  such  as  the  actual  transverse  charge- 
density  distribution  and  the  geometry  of  the  dielectric 
screening.  Numerical  studies  of  this  case3,4  indicate  that  the 
binding  energy  of  the  resulting  polarons  could  reach  a  sub¬ 
stantial  fraction,  roughly  one  third,  of  the  binding  energy  of 
the  corresponding  Wannier-Mott  excitons  (primary  optical 
excitations  in  many  ID  SCs). 

While  illuminating  the  basic  physics  of  the  effect,  the 
single-particle  description  has  the  drawback  of  not  explicitly 
including  valence-band  electrons,  whose  reorganization  may 
substantially  affect  the  outcomes.  Such  a  reorganization  can 
be  important  even  in  noninteracting  electron  models — a  nice 
illustration  was  given  in  Ref.  14  that  analyzed  how  polarons 
of  a  two-band  Peierls  dielectric  model  evolve  into  single¬ 
particle  Holstein  polarons  in  the  limit  of  the  “frozen  valence 
band”  approximation.  With  realistic  Coulomb  interactions  in 
place,  the  role  of  valence  electrons  in  the  formation  of  the 
relevant  self-consistent  potentials  would  only  increase.  As 
applied  to  electron-phonon  polarons  in  conjugated  polymers 
(CPs),  various  ab  initio  computational  schemes  have  been 
found  useful  to  clarify  the  role  of  electron-electron  interac¬ 
tions  on  such  polarons  and  their  optical  properties  (we  refer 
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the  reader  to  a  recent  account15  for  a  discussion  and  multiple 
references).  In  what  follows  we  use  the  GAUSSIAN  03  imple¬ 
mented  density-functional  theory  (DFT)  ab  initio  calcula¬ 
tions  in  conjunction  with  the  polarizable  continuum 
model1617  (PCM)  to  study  the  polaron  formation  in  conju¬ 
gated  linear  systems  as  mediated  by  the  polarization  interac¬ 
tion  with  a  surrounding  polar  solvent. 

Since  the  computational  demand  increases  substantially  in 
DFT-PCM  calculations,  we  have  selected  structurally  simple 
systems  of  hydrogen-terminated  polyynic  linear  carbon 
chains  CVH2  (even  N  between  20  and  100)  for  our  demon¬ 
stration.  A  polyynic  structure  exhibits  in  its  ground  state  an 
alternating  pattern  of  triple  C  =  C  and  single  C-C  bonds  and 
is  semiconducting.  It  should  be  noted  that  while  playing  the 
role  of  a  prototypical  example  in  our  study,  polyynic  chains 
continue  to  be  the  subject  of  much  attention  in  their  own 
right.18-22  Interestingly,  they  were  predicted23  to  possess  a 
rich  family  of  electron-lattice  self-localized  excitations  be¬ 
cause  of  the  extra  degeneracy  of  molecular-orbital  (MO)  lev¬ 
els. 

The  results  presented  in  this  paper  have  been  derived  by 
employing  DFT  computations  with  the  B3LYP  hybrid 
exchange-correlation  functional  known  to  include  local  and 
nonlocal  correlation  effects.  This  functional  is  commonly 
used  in  recent  studies  of  polyynes. 18-21  The  6-311  + 
+  G{d,p )  all  electron  basis  set  was  chosen  due  to  its  richness 
and  ability  to  represent  charged  systems.  We  note  that  our  “in 
vacuum”  calculations  have  been  verified  to  compare  well 
against  previously  published  ab  initio  results  on  neutral19  and 
charged20  polyynic  systems  both  in  terms  of  energetics  and 
optimized  bond  lengths.  The  DFT-PCM  (“in  solvent”)  calcu¬ 
lations  used  water  as  a  polar  solvent  with  its  default  param¬ 
eters  in  GAUSSIAN  03. 

A  pertinent  quantity  for  us  is  the  equilibrium  spatial  dis¬ 
tribution  of  the  excess  charge  over  the  ID  SC,  which  is 
naturally  related  to  atomic  charges  in  outputs  of  ab  initio 
calculations.  Importantly,  these  charges  represent  the  effect 
due  to  all  electrons  in  the  system.  It  is  well  known17  that 
calculations  of  the  atomic  charges  from  many-electron  wave 
functions  are  dependent  on  the  basis  set  transformations  and 
sometimes  lead  to  spurious  results.  Two  procedures,  due  to 
Mulliken  and  Lowdin,  are  used  particularly  frequently  in 
such  calculations.17  We  exercised  both  and  established  that 
while  they  may  lead  to  different  raw  atomic  charges  for  in¬ 
dividual  cases,  the  results  become  remarkably  close  when 
calculated  for  unit-cell-based  charge  differences  between 
charged  and  neutral  chains.  The  unit  cells  are  defined  here  to 
consist  of  pairs  of  neighboring  carbons,  with  the  chain  end 
cells  including  the  hydrogen  atomic  charges.  It  is  these  stable 
charge  differences — that  is,  excess  charge — per  cell  that  are 
displayed  in  Figs.  1,  2,  and  4(a)  using  Lowdin  numbers.  We 
studied  both  anionic  (an  extra  electron)  and  cationic  (an  extra 
hole)  chains.  Their  self-localization  behavior  has  been  found 
very  similar.  Accordingly  we  restrict  our  discussion  here  to 
the  anionic  cases,  with  the  excess  charge  in  the  figures  mea¬ 
sured  in  units  of  (-e),  where  e  is  the  magnitude  of  the  fun¬ 
damental  charge. 

To  be  able  to  clearly  separate  the  effect  of  the  solvation  on 
excess  charge  localization  from  the  effect  of  atomic  displace¬ 
ments,  we  start  by  exploring  systems  with  rigid  geometries, 


FIG.  1.  Excess  charge-density  spatial  distribution  on  charged 
chains  with  N=  20,  40,  60,  80,  and  100 — top  to  bottom 
curves,  respectively,  in  their  central  parts.  All  chains  feature  the 
same  pattern  of  fixed  bond  lengths  corresponding  to  RG1  geometry 
described  in  text.  Panel  (a)  displays  results  for  chains  in  vacuum, 
panel  (b)  for  chains  in  the  solvent  environment. 

that  is,  with  fixed  prescribed  atomic  positions.  Our  first  ge¬ 
ometry,  referred  to  as  RG1,  features  the  following  fixed 
lengths:  1.235  A  for  the  triple  bonds,  1.326  A  for  the  single, 
and  1.063  A  for  the  C-H  bonds.  These  lengths  have  been 
retrieved  from  our  complete  optimization  of  the  neutral  N 
=  80  chain  in  vacuum  and  are  therefore  expected  to  be  rela¬ 
tively  close  to  optimal  values.  We  now  use  this  geometry  to 
perform  DFT  calculations  for  a  set  of  CVH2  and  C  vHj 
samples  with  various  N  to  find  the  excess  charge-density 
distribution  both  in  vacuum  and  in  the  solvent  environment. 
The  derived  results  are  compared  in  Fig.  1  whose  (a)  and  (b) 
panels  display  clearly  distinct  trends.  Barring  the  end  effects. 


FIG.  2.  Excess  charge-density  spatial  distribution  on  charged 
chains,  (a)  N=  80.  The  evolution  of  the  charge  distribution 
upon  stretching  the  single  bonds  by  0.1  A  successive  increments 
from  1.326  to  1.726  A — bottom  to  top  in  a  group  of  five  clearly 
distinct  curves  shown  by  different  connected  symbols  and  denoted 
“in  solvent.”  The  same — but  unconnected — data  symbols  are  used 
to  show  the  results  for  chains  in  vacuum;  those  distributions  largely 
overlap,  (b)  Comparison  of  distributions  for  chains  in  the  solvent 
with  A=40,  60,  80  (different  data  symbols),  and  100  (solid  line)  for 
the  geometry  RG2  featuring  the  longest  single  bonds. 
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vacuum  solvent  vacuum  solvent 


FIG.  3.  Molecular-orbital  energy  levels  for  neutral  (0)  and 
charged  (-1)  C10oH2  chains  with  rigid  RG1  [(a)-(d)]  and  RG2  [(e)- 
(h)]  geometries  both  in  vacuum  and  in  the  solvent.  The  arrows 
indicate  positions  of  HOMO  levels. 

vacuum  results  in  panel  (a)  are  reminiscent  of  the  “particle  in 
a  box”  behavior:  as  the  chain  length  increases,  the  excess 
charge  is  distributed  more  and  more  uniformly  over  the 
whole  chain.  In  a  sharp  contrast,  for  chains  in  the  solvent, 
panel  (b),  the  excess  charge  exhibits  much  more  localized 
distributions  around  central  parts  of  the  chains.  As  the  chain 
length  increases,  those  distributions  show  a  tendency  toward 
convergence,  albeit  not  completely  achieved  within  the  range 
of  lengths  studied. 

To  further  the  understanding  of  the  observed  behavior,  we 
now  attempt  to  decrease  the  spatial  extent  of  charge  localiza¬ 
tion.  In  single-particle  models,  the  size  of  the  polaron  is  de¬ 
termined  by  the  balance  of  the  gain  in  the  potential  energy  of 
the  localized  carrier  and  the  loss  in  its  kinetic  energy.  De¬ 
creasing  the  latter  leads  to  a  diminished  delocalization  pro¬ 
pensity  and  is  expected  to  shorten  the  polaron.  This  should 
be  achievable  with  an  increased  intrinsic  effective  mass,  or 
with  a  narrower  electron  bandwidth.  To  imitate  the  effect,  we 
chose  to  artificially  stretch  all  single  bonds  in  our  system, 
thereby  making  C  =  C  “dimers”  more  and  more  separated 
from  each  other.  Figure  2(a)  illustrates  a  dramatic  difference 
in  the  response  of  the  excess  charge  distribution  to  succes¬ 
sive  single-bond  stretchings  for  chains  in  vacuum  and  for 
chains  in  solvent.  Whereas  charge  density  in  vacuum  is 
hardly  affected  in  this  process,  preserving  nearly  uniform 
distributions  over  the  whole  chain,  for  chains  in  solvent,  each 
successive  stretching  step  results  in  smaller  and  smaller  lo¬ 
calization  regions.  We  refer  to  the  fixed  geometry  with  the 
longest-stretched  single  bonds  of  1.726  A  as  RG2.  As  the 
localization  size  in  this  geometry  appears  to  be  well  within 
the  chain  length,  we  compare  the  excess  charge  distributions 
for  several  chain  lengths  in  Fig.  2(b).  The  distributions  for 
N=60,  80,  and  100  practically  coincide  confirming  that  they 
correspond  to  a  fully  converged  self-consistent  pattern  of  ex¬ 
cess  charge  localization  achieved  entirely  due  to  the  interac¬ 
tion  with  the  surrounding  solvent. 

A  comparison  of  the  corresponding  molecular-orbital 
energy-level  structures  is  available  in  Fig.  3.  One  immedi¬ 
ately  notices  a  much  wider  (but  inconsequential  for  the  effect 
we  consider)  band  gap  in  the  RG2  geometry.  The  narrowing 


FIG.  4.  Comparison  of  the  solvation  and  bond-length  adjustment 
effects  for  C4oH2.  (a)  Excess  charge  distributions  in  vacuum  (A) 
and  in  the  solvent  (□)  for  RG1  geometry  (dashed  lines)  and  for  the 
fully  optimized  bond-length  patterns  (solid  lines),  (b)  The  corre¬ 
sponding  MO  energy  levels  in  the  solvent  for  (bl)  optimized  neutral 
chain,  (b2)  charged  chain  in  RG1  geometry,  and  (b3)  optimized 
charged  chain.  The  arrows  show  HOMO  levels. 


of  the  bandwidth  is,  however,  relevant  and  easily  detected 
via  a  much  larger  density  of  states  [panels  (e)  vs  (a)  and  (g) 
vs  (c)].  Disregarding  “trivial”  (the  electrostatic  potential  with 
respect  to  infinity)  overall  shifts  of  the  levels  for  the  charged 
chains  in  vacuum,  the  important  difference  with  the  charged 
chains  in  the  solvent  is  that  the  latter  case  exhibits  local 
highest  occupied  molecular  orbital  (HOMO)  levels  clearly 
separated  from  other  states:  compare  panel  (d)  to  (b)  and, 
especially,  panel  (h)  to  (f).  This  is  the  behavior  expected 
from  single-particle  polaron  models.  One,  however,  does  not 
observe  distinctly  separate  unoccupied  local  levels  appearing 
in  such  models  with  long-range  Coulomb  potentials.3  Also,  a 
reorganization  of  valence-band  electron  states  is  apparent  in 
the  band  gap  upon  the  formation  of  the  polaron,  especially 
well  seen  in  geometry  RG2,  panel  (h).  This  is  suggestive  of  a 
more  complex  electronic  structure  of  the  polaron  as  affected 
by  many-electron  interactions. 

To  better  benchmark  the  localization  due  to  the  solvation, 
we  now  turn  to  bond-length  optimization  results  derived  for 
the  /V=40  system.  Figure  4(a)  pictures  modifications  of  the 
excess  charge  spatial  distribution  that  take  place  upon 
changes  from  the  rigid  RG1  geometry  to  fully  optimized 
bond-length  patterns  both  in  vacuum  and  in  the  solvent.  It  is 
evident  that  for  the  charge  distribution,  the  effect  of  the  lat¬ 
tice  relaxation  is  much  less  significant  than  the  effect  of  the 
solvation.  (The  same  conclusion  has  been  drawn  in  our  stud¬ 
ies  of  /Y=80  chains.)  Figure  4(b)  shows  how  local  HOMO 
and  intragap  valence  levels  are  affected  by  the  pure  solvation 
and  by  the  combined  action  of  the  solvation  and  lattice  dis¬ 
placements.  Interestingly,  here  the  effects  due  to  each  of  the 
mechanisms  appear  similar  in  magnitude  despite  a  much 
more  pronounced  effect  of  the  solvation  on  charge-density 
localization.  This  is  again  likely  indicative  of  the  intricacies 
of  many-electron  interactions  and  the  screening  effects.  A 
more  complete  understanding  of  similarities  and  differences 
of  these  mechanisms  calls  for  further  studies  at  various  com¬ 
putational  levels.  The  picture  of  Fig.  4(b)  suggests  that  com- 
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parative  spectroscopy  of  electronic  transitions  to  and  from 
local  HOMO  levels  should  be  a  helpful  experimental  tool. 

The  spatial  extent  of  demonstrated  solvation-induced  ID 
polarons  is  seen  to  be  on  the  order  of  tens  of  angstroms.  We, 
however,  do  not  feel  to  be  able  to  reliably  evaluate  the  po- 
laron  binding  energy  except  to  estimate  it  at  —0.1  eV  based 
on  comparisons  of  lowest  unoccupied  molecular  orbital 
(LUMO)  levels  with  the  computed  electron  affinities.  Our 
consideration  has  been  limited  to  chains  with  idealized 
straight  structures;  various  defects  including  spatial  confor¬ 
mations  of  the  chains  can  lead  to  the  formation  of  bound 
polarons  with  possibly  larger  binding  energies. 

While  providing  a  clear  conceptual  demonstration,  the 
present  ab  initio  framework  may  overestimate  the  effects  of 
the  solvation  on  self-localization  as  it  does  not  take  a  proper 


account  of  the  frequency  dependence  of  the  solvent  dielectric 
function.3'4'7,9  This  issue  could  possibly  be  addressed  within 
time-dependent  DFT-PCM  computational  schemes.24  An¬ 
other  current  development,  on  the  scaling  description  of  sol¬ 
vent  effects,25  may  also  help  in  studies  of  large  polaron  for¬ 
mation.  Given  the  general  character  of  the  solvation-induced 
self-localization  with  the  subsequent  decrease  of  charge  car¬ 
rier  mobility,2’6’26  as  well  as  its  expected  significance  for  fun¬ 
damental  redox  processes  on  various  ID  SC  structures,  fur¬ 
ther  studies  of  this  effect  seem  quite  interesting  and 
warranted. 

We  gratefully  acknowledge  financial  support  from  the 
Collaborative  U.  T.  Dallas-SPRING  Research  and  Nanotech¬ 
nology  Program. 
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When  an  excess  charge  carrier  is  added  to  a  one-dimensional  (ID)  wide-band  semiconductor 
immersed  in  a  polar  solvent,  the  carrier  can  undergo  self-localization  into  a  large-radius  adiabatic 
polaron.  We  explore  the  local  optical  absorption  from  the  ground  state  of  ID  polarons  using  a 
simplified  theoretical  model  for  small-diameter  tubular  structures.  It  is  found  that  about  90%  of  the 
absorption  strength  is  contained  in  the  transition  to  the  second  lowest-energy  localized  electronic 
level  formed  in  the  polarization  potential  well,  with  the  equilibrium  transition  energy  larger  than  the 
binding  energy  of  the  polaron.  Thermal  fluctuations,  however,  can  cause  a  very  substantial — an 
order  of  magnitude  larger  than  the  thermal  energy — broadening  of  the  transition.  The  resulting 
broad  absorption  feature  may  serve  as  a  signature  for  the  optical  detection  of  solvated  charge 
carriers.  ©  2009  American  Institute  of  Physics.  [DOI:  10.1063/1.3046292] 


I.  INTRODUCTION 

One-dimensional  (ID)  wide-band  semiconductor  (SC) 
nanostructures  (e.g.,  nanotubes  and  nanowires)  in  contact 
with  polar  solvents  is  an  interesting  class  of  systems  of  par¬ 
ticular  relevance  to  applications  and  processes  involving  fun- 
damental  redox  reactions.  '  When  an  excess  charge  carrier 
(an  electron  or  a  hole)  is  added  to  such  a  structure,  the  carrier 
can  become  solvated  resulting  in  an  adiabatic  polaron,  a  self- 

consistent  combination  of  a  localized  electronic  state  and  a 

4—8 

dielectric  polarization  pattern  of  the  surrounding  medium. 
The  long-range  Coulomb  mechanism  of  the  polaron  forma¬ 
tion  here  is  analogous  to  the  well-known  three-dimensional 
(3D)  polarons  in  polar  SCs  (Refs.  9-11)  and  solvated  elec- 
trons  in  polar  liquids.  '  Distinct  from  those  cases,  how¬ 
ever,  are  the  confinement  of  the  electron  motion  in  a  ID 
nanostructure  and  its  structural  separation  from  the  3D  po¬ 
larizable  medium. 

The  physical  properties  of  strong-coupling  polarons  are 
quite  different  from  the  nearly  free  band  electrons  frequently 
discussed  in  the  context  of  electronic  transport  in 
nanotubes14' 1:1  and  nanowires.16  We  have  already  emphasized 
the  energetic  significance  of  the  solvation  by  finding5'6  that 
the  binding  energy  of  the  resulting  ID  polarons  could  reach  a 
substantial  fraction,  roughly  one-third,  of  the  binding  energy 
of  Wannier-Mott  excitons,  the  well-known  primary  photoex¬ 
citations  in  many  ID  SCs.  This  may  lead  to  enhanced  charge 
separation.  On  the  other  hand,  the  mobility  of  solvated 
charge  carriers  is  drastically  reduced  due  to  the  dissipative 
drag  of  the  medium.4'8'17  In  this  paper  we  address  qualitative 
theoretical  expectations  for  the  local  optical  absorption  from 
solvation-induced  ID  polarons.  This  kind  of  the  optical  ab¬ 
sorption  has  been  a  powerful  tool  in  optical  detection  of 
electron-lattice  polarons  in  conjugated  polymers  as  well  as  of 
3D  solvated  electrons. 
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While  the  polaronic  effect  due  to  solvation  and  the  fea¬ 
tures  we  discuss  have  a  generic  character,  specific  illustrative 
calculations  in  this  paper  are  done  using  a  simplified  model 
representation  for  small-diameter  tubular  structures.  A 
widely  known  example  of  such  structures  is  semiconducting 
single-wall  carbon  nanotubes  (SWCNTs).  We  note  that  redox 
chemistry  of  CNTs  has  been  deemed  an  “emerging  field  of 
nanoscience”  1  and  solvatochromic  effects  in  CNTs  are  be¬ 
ing  intensely  researched. 

As  discussed  in  more  detail  later.  Fig.  1  displays  calcu¬ 
lated  functional  dependences  for  the  binding  energy,  Eh,  and 
the  spatial  extent,  /,  of  ID  adiabatic  polarons  in  terms  of  the 


aB/R 


FIG.  1.  The  binding  energy,  Eb,  and  the  linear  size,  /,  of  ID  adiabatic 
polarons  as  a  function  of  the  confinement  parameter  aB/R.  The  size  /  is 
defined  here  as  the  FWHM  of  the  localized  electron  density.  Different 
curves  correspond  to  different  dielectric  conditions,  as  explained  more  fully 
in  the  text:  filled  (solid  lines),  hollow  (long-dash),  and  polarizable  (short- 
dash)  nanotube  screening  models. 
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tube  radius  R  and  convenient  scales  of  the  effective  Bohr 
radius  and  Rydberg  energy  for  the  corresponding  3D  Cou¬ 
lomb  problem20 

aB=  e*h2/mq2,  Ry  =  q2/2e*aB.  (1) 

Here  q  is  the  carrier  charge,  m  its  effective  mass,  and  e*  is 
the  effective  dielectric  constant  of  the  uniform  3D  medium. 
These  scales  arise  naturally  in  the  analysis  (e.g.,  Refs.  5  and 
6  and  below  in  Sec.  II)  of  the  polaronic  effect  due  to  the 
long-range  polarization  interaction.  They  result  from  the  bal¬ 
ance  between  the  carrier  delocalization  energy  loss  and  the 
polarization  energy  gain  due  to  localization.  The  length  /  of 
the  polaron  has  been  defined  here  as  the  full  width  at  half 
maximum  (FWHM)  of  the  localized  ID  electron  charge  den¬ 
sity.  In  what  follows  we  refer  to  the  ratio  aBIR  as  the  con¬ 
finement  parameter.  It  is  well  known  that  the  spatial  confine¬ 
ment  generally  results  in  amplification  of  the  Coulomb 
binding,"  and  Fig.  1  illustrates  both  the  magnitude  and  the 
growth  of  Ebl Ry  for  ID  polarons  with  the  increased  confine¬ 
ment  parameter.  For  a  proper  perspective,  one  should 
compare6  those  results  with  the  polaron  binding  energy  in 
higher  dimensions,  where  the  classic  Pekar’s  result  for 
Eb  in  3D  is  only  —0.1  Ry,  while  in  two-dimensional  (2D) 
systems"4  it  increases  to  —0.4  Ry.  Our  variational 
calculations6  for  cylindrical  surfaces  have  shown  that  the 
transition  to  a  ID  polaron  structure  (charge  uniformly  dis¬ 
tributed  over  the  tube  circumference)  takes  place  at  aBl R 
<:  1 ,  this  is  the  regime  of  our  interest  in  this  paper.  At  smaller 
values  of  the  confinement  parameter,  nonuniform  2D  charge 
distributions  can  take  place. 

As  exemplified  in  Fig.  1,  we  present  our  results  in  a 
dimensionless  form  irrespective  of  numerical  values  of  ap¬ 
propriate  scales  (1).  This  way  one  can  allow  for  uncertainties 
of  values  of  individual  system  parameters  and,  more  impor¬ 
tantly,  study  the  trends  of  the  system  behavior  as  dependent 
on  relevant  combinations  and  scaling  with  the  confinement 
parameter.  It  is,  however,  useful  and  interesting  to  make 
some  tentative  estimates  of  representative  numerical  values 
in  specific  cases.  The  effective  dielectric  constant  in  Eq.  (1) 
varies  for  different  solvents  and  can  be  determined  from  the 
well-known9'1 1,13,23  relationship 

1/e*  =  1/Coo  -  1  les, 

typical  ’  static  es  and  high-frequency  e,  constants  satisfy 
es  §>  so  that  e*  —  ex.  Wide-band  SCs  we  are  interested  in 
are  commonly  characterized  by  effective  band  masses  of 
charge  carriers  much  smaller  than  the  free  electron  mass  me 
(e.g.,  —0.1  me).  For  SWCNTs,  in  particular,  band  masses  of 
carriers  generally  depend  both  on  tube  radius  and  chirality; 
see,  e.g.,  Ref.  15  for  a  compilation  of  some  theoretical  data. 
If,  for  instance,  one  were  to  use  the  band  mass  of  0.05 me 
corresponding  to  R—  8  A  (Ref.  15)  for  m,  along  with  a  rep¬ 
resentative  value  of  e*  =  3,  then  Eq.  (1)  would  result  in  aB 
—  32  A  and  Ry— 76  meV.  In  this  particular  example,  then, 
aBIR  —  4  and  Eb~ 0.1  eV.  Larger  values  of  m  or  smaller 
values  of  e*  would  lead  to  increased  magnitudes  of  Eh  by 


virtue  of  larger  values  of  Ry.  The  dependence  of  the  band 
mass  on  radius  R  in  SWCNTs  [roughly  <*l/R  (Ref.  15)], 
however,  would  make  the  confinement  parameter  in  this  es¬ 
timate  nearly  independent  of  the  radius  (see  also  Sec.  IV). 
We  will  be  employing  specific  values  of  aBIR  =  4  and  10  for 
some  plots  below,  which  are  meant  to  serve  mainly  illustra¬ 
tive  purposes. 

Using  the  band  mass  for  m  in  the  above  consideration 
tacitly  assumed  that  the  charge  carrier  is  not  affected  by  in¬ 
teractions  with  vibrational  degrees  of  freedom  of  the  ID  SC. 
Meanwhile,  electron-phonon  (and  exciton-phonon)  interac¬ 
tions  in  ID  systems  are  well-known  drivers  of  the  polaronic 
effect  on  their  own.  This  includes  self-localized  strong¬ 
coupling  polaronic  (solitonic)  states,  e.g.,  in  molecular  crys¬ 
tals  and  conjugated  polymers,  that  have  been  studied  in  a 
great  body  of  work  since  the  pioneering  contributions  of 
Rashba"6  and  Holstein,"7  as  well  as  various  intermediate  re¬ 
gimes  (see,  e.g.,  an  extensive  discussion  in  Ref.  28).  The 
problem  of  a  charge  carrier  strongly  interacting  both  with 
displacements  of  the  underlying  atomic  lattice  and  with  the 
surrounding  solvent  is  interesting  and  would  require  a  corre¬ 
sponding  self-consistent  treatment  which  we  do  not  address 
in  this  paper.  The  systems  of  our  interest  here  exhibit  instead 
a  weak  electron-phonon  coupling  (in  the  parlance  of  the 
theory  of  polarons23)  that  result  only  in  a  relatively  slight 
renormalization  of  the  band  parameters.  Indeed,  first 
calculations  of  electron-lattice  polarons  on  SWCNTs  due  to 
acoustic  vibrations  revealed  minuscule  binding  energies  of 
fractions  of  meV.  More  refined  recent  calculations  included 
the  interaction  with  optical  phonons:  although  the  band  en¬ 
ergy  shift  was  now  found  much  larger,  on  the  order  of  tens  of 
meV,  it  was  at  the  same  time  much  smaller  than  the  corre¬ 
sponding  phonon  energy.  In  accordance  with  this  weak- 
coupling  feature,  the  renormalization  of  band  velocities  in 
calculations  of  Ref.  30  was  found  to  be  just  a  few  percent. 
Also  consistent  with  the  picture  of  weak  electron-phonon 
scattering  in  SWCNTs  are  the  resulting  huge,  on  the  order  of 
104— 105  cm2/V  s,  mobilities  of  charge  carriers.15'30  In  con¬ 
trast,  solvated  charge  carriers  are  expected  to  have  mobilities 

g 

orders  of  magnitudes  smaller  as  follows  from  the  strong 
interaction  with  the  solvent.  The  net  result  of  the  weak 
electron-phonon  coupling  for  our  carrier  solvation  problem  is 
therefore  a  slight,  few  percent,  increase  in  the  effective  mass 
m  from  its  band  values  in  the  rigid  lattice.  Hence,  estimates 
of  solvation-induced  polaron  bindings  of  0. 1-0.2  eV  appear 
as  reasonable  representative  values.  Moreover,  it  should  be 
noted  that  such  estimates  would  be  in  apparent  agreement 
with  experimental  reports'  of  exciton  binding  energies  of 
0.4-0. 6  eV  in  SWCNTs  and  our  earlier  assessments5'6  of  the 
relationship  between  polaron  and  exciton  bindings. 

Figure  2  displays  an  example  of  the  equilibrium  ID  po¬ 
larization  potential  well  resulting  from  the  solvation  of  an 
excess  charge  carrier.  The  corresponding  pattern  of  the  me¬ 
dium  polarization  is  self-consistently  stabilized  by  the  elec¬ 
tric  field  of  a  charge  carrier  occupying  the  lowest-energy 
localized  electronic  level  formed  in  this  well  (the  ground 
state  of  the  polaron).  Along  with  that  level,  also  shown  are 
the  two  next  higher-energy  empty  levels.  The  results  dis¬ 
cussed  below  indicate  that  it  is  the  transition  between  the  two 
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FIG.  2.  The  equilibrium  self-consistent  ID  potential  well  (x  is  a  coordinate 
along  the  tube  axis)  along  with  the  three  lowest-energy  electronic  energy 
levels  calculated  for  aB/R= 4.  The  optical  transition  with  the  largest  oscilla¬ 
tor  strength  is  indicated  by  the  arrow,  the  dashed  lines  showing  the  localized 
electronic  wave  functions  of  the  participating  states. 

lowest-energy  levels  that  has  the  largest  oscillator  strength 
and  dominates  the  optical  absorption  in  the  ground  state.  The 
long-range  behavior  of  the  potential  well  is  Coulombic,  and 
hence  it  contains  many  more  higher-energy  localized  states 
(not  shown  in  Fig.  2)  converging  to  the  onset  of  the  con¬ 
tinuum  spectrum  at  zero  energy  in  the  figure. 

Our  consideration  in  this  paper  assumes  that  the  polaron 
binding  energy  Eb  is  sufficiently  larger  than  the  thermal  en¬ 
ergy  kBT  so  that  the  thermal  occupation  of  higher-energy 
electronic  states  can  be  neglected.  Even  so,  thermal  fluctua¬ 
tions  are  found  to  very  significantly  broaden  the  optical  tran¬ 
sition  in  question,  as  is  also  the  case  for  3D  solvated 
electrons.34  In  the  presence  of  the  polaron,  the  fluctuations 
involve  combined  dynamics  of  the  electronic  charge  density 
and  the  solvent  dielectric  polarization.  A  major  contributor  to 
the  broadening  is  the  resulting  ID  polaron  “breathing”  mode, 
qualitatively  corresponding  to  fluctuations  of  the  polaron 
length.  Without  pursuing  the  exact  calculation  of  the  absorp¬ 
tion  line  shape,  we  will  provide  an  approximate  estimation  of 
the  broadening  due  to  the  breathing  mode  within  the  classical 
Franck-Condon  framework.  '  The  described  broad  absorp¬ 
tion  feature  could  be  used  for  detection  of  the  solvation- 
induced  ID  polarons  and  an  estimate  of  their  binding  energy 
in  prospective  experimental  studies. 


II.  THE  GROUND-STATE  AND  OPTICAL  TRANSITIONS 
IN  EQUILIBRIUM 

In  the  single-particle  ID  continuum  adiabatic  frame¬ 
work,  the  excess  charge  carrier  is  described  by  the  wave 
function  i/Ax) ;  it  responds  to  the  electrostatic  potential  <f>{x) 
due  to  the  surrounding  medium  via  the  corresponding 
Schrodinger  equation: 

h~  cPil/Ax) 

-  - - z~2  +  q<Kx)  0iW  =  .  (2) 

2m  dx 


The  magnitude  of  the  effective  mass  in  here  determines  the 
propensity  of  the  charge  carrier  to  delocalize  along  the  ID 
SC:  the  smaller  the  m  the  larger  the  loss  in  the  kinetic  energy 
is  due  to  given  spatial  localization.  In  the  absence  of  the 
potential  (/Ax) ,  eigenfunctions  of  Eq.  (2)  would  be  just  plane 
waves  i/Ax) a  exp(zfcc)  corresponding  to  kinetic  energies 
E{k)  =  h2k2 /  2m  of  delocalized  charge  carriers.  As  we  dis¬ 
cussed  in  Sec.  I,  a  weak  electron-phonon  coupling  results  in 
a  slight  increase  in  m  with  respect  to  the  band  mass  in  a  rigid 
lattice  structure. 

We  now  proceed  with  the  discussion  of  self-localization. 
In  the  ground  state,  the  charge  carrier  occupies  the  lowest- 
energy  (/  =  0)  state  so  that  the  resulting  ID  electronic  charge 
density  is 

p{x)  =  q\i//0(x)\2 .  (3) 

At  equilibrium ,  the  self-consistency  requires  that  this  charge 
density  stabilizes  the  very  electrostatic  potential  used  in  Eq. 
(2): 

(f>(x)  =  J  dx'G(x-x')p(x'),  (4) 

where  kernel  G  is  the  appropriate  electrostatic  response  func¬ 
tion  assuming  the  translational  invariance  along  the  x-axis. 
Solving  Eqs.  (2)— (4)  together  yields  the  equilibrium  ID  po¬ 
laron  distributions  that  we  will  be  denoting  as  c/>{x)  and  p(x) . 
One  notes  that  the  equations  determining  the  equilibrium 
correspond  to  the  minimum  of  the  total  adiabatic  energy  of 
the  system: 

E^^Eo  +  Ep,  (5) 

consisting  both  of  the  electronic  energy  E0  in  Eq.  (2)  and  the 
energy  stored  in  the  dielectric  polarization  of  the  medium: 

Ef=-J  dxdx’ 4>{x)G~l{x  -  x’)4>{x') , 

where  G~l  is  the  kernel  inverse  to  G  (see  also  Ref.  5).  As  the 
continuum  onset  in  Eq.  (2)  is  set  at  zero  energy,  the  “depth” 
of  this  minimum  is  the  polaron  binding  energy: 

min{En  =  -Eb.  (6) 

The  data  for  Eh  and  equilibrium  p(x)  and  q  (/Ax)  have  been 
used  in  Figs.  1  and  2.  Other  adiabatic  energy  surfaces  Ef' 
=Ej+EP  are  defined  similarly  to  Eq.  (5). 

The  electrostatic  potential  4>{x)  in  Eq.  (2)  is  due  to  only 
the  slow  (orientational)  polarization  of  the  surrounding  me¬ 
dium  and  hence  should  not  include  the  “instantaneous”  self¬ 
interaction  of  the  charge  carrier  with  itself.  A  proper  separa¬ 
tion  of  the  slow  polarization  response  is  a  standard  step  in 
the  polaron  problem.11'13,23’35  The  kernel  G  in  Eq.  (4)  can 
therefore  be  written  as 

G(x)  =  Gs(x)-Gjx),  (7) 

where  indices  s  and  °°  denote  that  the  standard  electrostatic 
potential  problem  (4)  is  solved  with  dielectric  conditions  cor¬ 
responding  respectively  to  full  (static,  s)  screening  or  to 
screening  only  by  “fast”  (°°)  components  of  polarization.  In 
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the  polaron  context,  fast  means  operative  on  time  scales 
shorter  than  hi Eb.  In  the  model  we  consider,  the  continuum 
solvent  is  characterized  by  the  corresponding  dielectric  con¬ 
stants  es  and  e.x.  Another  fast  contributor  to  the  screening  is 
the  polarizability  of  the  ID  SC  nanostructure  itself  related  to 
interband  electronic  transitions. 

In  Fourier  space,  g{k)=  fdxe~‘kxG(x),  Eq.  (7)  can  be  con¬ 
veniently  rewritten  as 


g(k)  =  g0(k) 


1 

eM) 


(8) 


where  the  ^-dependence  of  the  corresponding  dielectric  func¬ 
tions  reflects  a  possible  spatial  dispersion  of  the  screening 
due  to  the  geometry  of  the  system. 

The  bare,  unscreened,  response  for  the  ID  tubular  geom¬ 
etry  (the  “elementary”  charge  distributions  are  uniform  rings 
of  radius  R)  is  given  by5'36 

g0(k)  =  2I0(kR)K0(kR),  (9) 
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FIG.  3.  (a)  The  energies  Et  of  the  three  lowest  electronic  energy  levels  in  the 
ground  state  equilibrium  as  a  function  of  the  confinement  parameter  for 
filled  (solid),  hollow  (long-dash),  and  polarizable  (short-dash)  screening 
models,  (b)  The  ratio  of  the  polaron  binding  energy,  Eh,  to  the  electronic 
transition  energy,  El0,  in  equilibrium  calculated  with  the  same  models. 


where  I(x)  and  K(x)  are  the  modified  Bessel  functions  ap¬ 
pearing  in  electrostatic  problems  with  cylindrical 
symmetry/  The  general  form  of  the  corresponding 
/c-dependent  dielectric  functions  can  be  represented5  as 

e(k)  =  kR[elIl{kR)K0(kR)  +  CiJo^K^kR)]  +  I1(A:), 

where  ex  is  the  dielectric  constant  of  the  dielectric  medium 
inside  the  tube,  e2  is  the  dielectric  constant  of  the  unbounded 
medium  outside  the  tube,  and  Tf(A')  is  the  contribution  arising 
from  the  polarizability  of  the  tubular  surface  itself. 

If  the  latter  contribution  is  neglected  and  the  media  in¬ 
side  and  outside  are  the  same  (we  will  be  calling  this  model 
case  “filled”),  there  would  be  no  spatial  dispersion  in  the 
screening  (uniform  medium)  and  response  (8)  takes  a  simple 
form  of 

g{k)  =  -  g0(k)l  e* .  (10) 

With  this  form,  one  easily  sees  how  the  original  problem 
description  provided  by  Eqs.  (2)-(4)  and  (9)  can  be  con¬ 
verted  into  a  completely  dimensionless  form  employing  all 
lengths  measured  in  units  of  aB  and  all  energies  in  units  of 
Ry,  Eq.  (1).  This  way,  relevant  combinations  of  system  pa¬ 
rameters  are  properly  recognized,  and  the  confinement  pa¬ 
rameter  aBl  R  appears  naturally  in  the  response  function  (9). 
For  another  discussion  of  energy  and  length  scales,  see  also 
Ref.  6  comparing  Wannier-Mott  and  polaron  energy  func¬ 
tionals. 

In  order  to  evaluate  the  magnitude  of  the  effects  arising 
from  different  dielectric  screening  conditions,  we  also  exam¬ 
ine  the  model  cases  called  “hollow,”  where  the  interior  of  the 
tube  does  not  contain  any  medium  (^  =  1),  and  “polarizable,” 
in  which  the  hollow  model  is  augmented  by  the  tube  polar¬ 
izability.  For  the  model  of  the  latter,  we  will  use  the  follow¬ 
ing  approximate  representation  of  the  polarizability  of  semi¬ 
conducting  SWCNTs  calculated  in  Ref.  38: 

6.2(kR)2I0(kR)K0(kR) 

n(‘)=  1  + 1.6(4*)'*  • 


Figure  1  compares  the  results  for  the  ground  state  of  the 
polaron  calculated  with  the  three  dielectric  screening  models, 
in  which  we  used  representative  values  of  e.x  =  3  and  es  =  40 
for  the  solvent  medium.  It  is  evident  from  the  comparison 
that  in  the  parameter  range  we  study,  different  screening 
models  do  not  lead  to  drastically  different  results.  The  varia¬ 
tions  observed  are  quite  understandable  and  are  largely 
caused  by  the  effective  changes  in  the  magnitude  of  the  fast 
screening,  which  is  decreased  by  going  from  the  filled  to 
hollow  model  and  is  increased  in  the  polarizable  model. 
Relatively  small  changes  are  also  observed  in  the  results  for 
individual  electronic  energy  levels  displayed  in  Fig.  3.  Panel 
(a)  of  that  figure  just  shows  the  scaling  of  the  three  lowest 
electronic  energy  levels  E,  (i  =  0, 1 ,2)  from  Eq.  (2)  as  formed 
in  the  equilibrium  self-consistent  potential  well  (see  Fig.  2). 

For  the  optical  absorption  from  the  ground  state,  a 
Franck-Condon  framework  discussion  requires  electronic 
transition  energies 


£,-o  =  £j--£o-  (11) 

The  dominant  dipole  electronic  transition  turns  out  to  be  i 
=  0— +i  =  l,  and  Fig.  3(b)  shows  the  relationship  between  the 
polaron  binding  energy  Eb  and  the  equilibrium  value  of  £j0. 
[To  compare,  the  ratio  of  these  energies  for  3D  Pekar’s  po¬ 
laron  is  about  0.8  (Ref.  23).]  More  detailed  information  on 
this  transition  for  our  ID  polaron  is  displayed  in  Fig.  4;  the 
most  important  conclusion  from  which  is  that  the  transition 
contains  nearly  90%  of  the  total  absorption  strength  across  a 
wide  range  of  the  confinement  parameter.  This  overall  domi¬ 
nance  is  similar  to  the  one  taking  place  for  3D  (Ref.  34)  and 
2D  (Ref.  39)  solvated  electrons. 

Since  Eq.  (2)  has  a  form  of  a  standard  continuum 

Schrodinger  equation,  the  overall  absorption  satisfies  the  op- 
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tical  sum  rule  familiar  from  atomic  systems 
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FIG.  4.  Oscillator  strength  and  transition  energy  for  the  optical  transition 
between  i= 0  and  i=  1  electronic  eigenstates  in  the  equilibrium  potential 
well.  Calculations  done  with  different  screening  models  are  denoted  as  in 
Figs.  1  and  3. 

S/i0=  i. 

i 

where  the  oscillator  strength  of  the  transition  to  the  ith  state 
2  h1 

fio  =  —zrlii/'iix) |  d I dx\ipQ{x))\2 .  (12) 

mEi0 

In  terms  of  the  frequency  co-dependent  oscillator  strength 
density,40  the  sum  rule  per  one  polaron  reads 

f  f(co)dco=  1,  (13) 

do 

where,  for  sharp,  discrete  transitions  (11), 

f(oj)  =  2  f,o <5(  ■ w  -  w,0) ,  oj,0  =  Ei0/h.  (1 4) 

i 

We  will  discuss  in  Sec.  Ill  how  the  thermal  broadening  of 
these  transitions  occurs  resulting  in  a  continuous  /(«). 

As  Eq.  (12)  clearly  indicates,  the  optical  absorption  in 
our  case  is  due  to  the  electric  field  polarized  parallel  to  the 
tube  axis.  The  data  in  Fig.  4  have  been  derived  by  calculating 
matrix  elements  in  Eq.  (12)  between  the  states  in  the  equi¬ 
librium  potential  well  (and  the  sum  rule  verified  in  all  cases). 

III.  THERMAL  BROADENING  OF  THE  OPTICAL 
ABSORPTION 

In  the  Franck-Condon  framework,  the  optical  absorption 
from  the  polaron  is  realized  by  virtue  of  “vertical”  electronic 
transitions  between  the  adiabatic  potential  surfaces  E‘ot, 
meaning  that  the  polarization  state  of  the  environment  does 
not  change  while  the  transition  takes  place.  This  picture  of 
slow  modulating  dynamics  of  the  environment  leads  to  what 
is  called  inhomogeneous  broadening  of  the  absorption  line 
shape  (see  Chapter  5  of  Ref.  35  for  an  extensive  discussion) 
and  is  well  justified  for  our  application  to  the  case  of  the 
orientational  solvent  polarization.  Indeed,  the  parameter 
range  of  our  interest  and  practical  significance  corresponds  to 
the  regime  of  Eb> kBT>hl  tl,  and,  hence. 


tl  >  h(kBTEb)-m, 

which  is  a  condition  of  the  applicability  of  the  semiclassical 

35 

Franck-Condon  picture."  Here  tl  is  the  longitudinal  relax¬ 
ation  time  of  the  solvent  defining  the  relevant  time  scale  for 
the  solvent  orientational  dynamics  and  equal  to13,25,41 

TL  =  (eJes)TD 

for  the  Debye  solvent  described  by  the  co-dependent  dielec¬ 
tric  function 


e{m)  =  eO0+^^.  (15) 

1  -  10)Td 

Typical  solvents  are  characterized  by  a  wide  range  of  rL 

5  95 

ranging  from  fractions  to  tens  of  picoseconds.  ’ 

Thermal  fluctuations  result  in  variations  of  the  potential 
distribution  <f>(x)  in  Eq.  (2)  from  the  equilibrium  pattern  cf>{x) 
discussed  in  the  previous  section,  thereby  modulating  elec¬ 
tronic  wave  functions  and  transition  energies  (11).  Averaging 
over  various  fluctuations  would  thus  lead  to  broadening  of 
the  discrete  transitions  in  Eq.  (14).  In  the  discussed  semiclas¬ 
sical  picture,  the  weighting  of  various  fluctuations  in  the 
ground  state  is  described  by  the  Boltzmann  factor  associated 
with  fluctuations  of  the  total  adiabatic  energy  (5)  from  the 
minimum  equilibrium  value  (6).  Of  course,  fluctuations  in 
the  presence  of  the  polaron  are  not  the  fluctuations  of  the 
solvent  alone  but  rather  combined  fluctuations  of  the  me¬ 
dium  and  localized  electronic  charge  density. 

One  can  get  an  insight  into  the  nature  of  thermal  fluc¬ 
tuations  of  the  polaron  shape  by  using  explicit  results  derived 
in  Ref.  8  for  small-amplitude  fluctuations  in  a  simplified 
model  of  the  charge  carrier  interacting  with  the  Debye  sol¬ 
vent  (15)  with  response  function  (10).  Within  that  model, 
shape  fluctuations  are  governed  by  amplitudes  an(t )  in  the 
expansion  of  the  time-dependent  electric  potential  cf>(x ,  t)  and 
self-localized  carrier  charge  density  p(x,t)  over  the  normal 
dielectric  relaxation  modes: 

c/>{x,t)  =  <£(x-  £(r))  +  2  an(t)(/)n(x-  £(t)),  (16a) 

«>1 


p(x,t)  =  p(x-i;(t))  +  2  an(t)pn(x-  £(f)),  (16b) 

«>1 

where  the  first  terms  in  the  right  hand  side  of  Eq.  (15)  cor¬ 
respond  to  the  equilibrium  static  patterns.  Both  equilibrium 
patterns  and  normal  modes  are  centered  around  arbitrary  po¬ 
laron  centroid  positions  £(f).  The  dynamics  of  £(f)  results  in 
the  diffusion  of  the  polaron  as  a  whole  and  does  not  affect 
the  optical  absorption  line  shape.  The  spatial  patterns  of  the 
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normal  modes,  (</>„(x)  ,p„(x)),  follow  from  the  solutions  of 
the  generalized  eigenvalue  (A.,,)  problem: 


p„(x)  =  -  J  dx' L(x,x')4>n{x') , 

(17a) 

<£«(*)=- TfJ  dx'G0{x-x')pn(x'), 

(17b) 

where  spatial  kernel  L  is  determined  by  the  response  of  the 
electronic  subsystem  (see  Ref.  8  for  more  detail).  Equations 
(16)  yield  shape-modulating  modes  (n>l)  with  growing  ei¬ 
genvalues  A„  >  1  as  well  as  the  zero-frequency  translational 
mode  with  A0=  1,  which  is  excluded  in  Eq.  (15)  in  favor  of 
the  collective  coordinate  £{t). 

For  our  purposes  here,  it  is  convenient  to  choose  the 
normalization  of  the  modes  as 

-  J  dxpm  (x)  4>n  (x)  =  —  Snm ,  (18) 

note  that  coefficients  an{t)  are  then  measured  in  units  of  the 
square  root  of  energy.  The  equations  of  motion  for  so  nor- 


malized  modes  read" 

d„  +  (i„l  Tn  =  7],n 

(19) 

where  mode  relaxation  times  r„  are  given  by 

tlIt„  =  1  -  1/A„ 

(20) 

and  perturbations  rjn  by 

Vniid)  =  ~  K  J  dxp„(x)F(x  +  f,r) . 

(21) 

As  the  eigenvalues  A.,,  grow  with  the  mode  index  n,  the  re¬ 
laxation  times  r„  in  Eq.  (20)  converge  to  the  solvent  longi¬ 
tudinal  relaxation  time  tl  (see  Ref.  8  for  an  illustration  of 
this  convergence). 

For  thermal  dielectric  fluctuations,  F{x,t)  in  Eq.  (21)  is  a 
random  process  with  a  vanishing  average  and  correlations 

g 

established  by  “bare”  solvent  fluctuations 

(F(x,t)F{x' ,t'))  -  G0{x-x')S(t-t'). 

d  rL 

(22) 

It  then  follows  from  Eqs.  (18),  (21),  and  (22)  that  different 
modes  n  and  in  exhibit  no  dynamic  correlations,  and  the  only 
relevant  (for  the  same  centroid  coordinate)  nonzero  correla¬ 
tion  is 

(Vn(i,t)  Vn{&'))  =  2 DS(t  - 1'),  D  =  kBT/rL, 

with  the  effective  diffusion  coefficient  D  independent  of  the 
mode  index  with  the  chosen  normalization. 

The  random  dynamics  described  by  Eq.  (19)  is  thus  the 
Ornstein-Uhlenbeck  process,42  which  leads,  at  times  t>  r„, 
to  the  Gaussian  distribution  of  an  with  the  variance  estab¬ 
lished  by 

(fl»)  =  Dt„, 

the  probability 


Mode  amplitudes 
(arb.  units) 


FIG.  5.  Modulation  of  the  two  lowest  electronic  energy  levels  E0  and  by 
the  normal  fluctuations  modes,  Eq.  (15),  as  a  function  of  mode  amplitudes  in 
arbitrary  units.  The  solid  lines  show  the  modulation  by  the  even  mode  n 
=  1  and  long-dash  lines  by  the  odd  mode  n= 2.  The  confinement  parameter 
used  for  this  illustration  is  aB/R=  10. 


P(a„)  K  exp1 


kBT  r 


(23) 


Equation  (23)  features  the  classic  Boltzmann  distribution  for 
independent  fluctuation  modes  with  the  effective  potential 
energies  Un(an).  The  behavior  of  U„  is  consistent  with  be¬ 
havior  (20)  of  relaxation  times. 

As  the  normal  mode  index  n  grows,  the  interaction  of  the 
solvent  polarization  with  the  electronic  density  in  the  ground 
state  generally  decreases,  likewise  the  modulation  effect  on 
the  transition  energy  E10  generally  diminishes.  The  largest 
modulation  effect  occurs  due  to  the  even  normal  fluctuation 
mode  n=  1 ,  corresponding  to  the  shallowest  potential  energy 
U | .  This  modulation  is  illustrated  in  Fig.  5  comparing  the 
effects  of  two  normal  modes  on  the  electronic  energies  E0 
and  E1.  It  is  evident  that  the  major  contribution  comes  from 
the  modulation  of  the  ground  state  level  E0  by  the  n  =  1 
mode.  As  discussed  in  Ref.  8,  the  spatial  pattern  of  this  mode 
corresponds  qualitatively  to  the  modulation  of  the  polaron 
length  /,  we  will  accordingly  call  this  the  breathing  mode. 

While  illuminating  the  character  of  the  fluctuations  and 
their  modulation  effects,  the  small-amplitude  description 
may  be  insufficient  for  realistic  T  on  the  order  of  room  tem¬ 
perature.  In  this  paper  we  do  not  pursue  a  comprehensive 
analysis  of  the  absorption  line  shape  that  would  then  be  re¬ 
quired.  Instead,  we  will  provide  a  qualitative  assessment  of 
the  effect  of  the  polaron  breathing  using  a  simplistic  single¬ 
parameter  scaling  description  similar  in  the  spirit  to  the  ap¬ 
proach  used  in  recent  studies  of  continuum  adiabatic  models 
of  3D  electron  solvation43  and  of  electron  localization  at 
dielectric-metal  interfaces.  In  this  description,  the  fluctua¬ 
tions  are  restricted  to  modulate  the  electrostatic  potential  via 
a  scaling  ansatz  of  the  equilibrium  pattern 
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FIG.  6.  Variations  of  the  electronic  energy  SE0,  polarization  energy  SEP, 
and  the  total  adiabatic  energy  SEq1  with  the  scaling  parameter  c.  The  con¬ 
finement  parameter  used  for  this  illustration  is  aB/R=  10. 


4>{x)  =  cc/)(cx),  (24) 

where  c&0  is  the  scaling  parameter.  Its  deviation  from  the 
value  of  c=  1  results  in  variations  of  all  quantities  relevant 
for  the  optical  absorption.  It  should  be  noted  that  the  results 
of  small  deviations  from  c  =  1  turn  out  to  be  quite  close  to  the 
results  we  derived  with  exact  small-amplitude  breathing  fluc¬ 
tuations. 

Figure  6  displays  variations  of  different  energies  SE{c ) 
=  F(c)-F(c  =  1)  with  the  scaling  parameter  c.  It  is  evident 
that  the  shallow  response  of  the  total  energy  SE["1  to  fluctua¬ 
tions  around  the  equilibrium  is  a  result  of  a  near  compensa¬ 
tion  of  much  larger  variations  of  the  polarization  energy  SEP 
and  the  purely  electronic  energy  SE0,  as  indeed  should  be  the 
case  for  the  ground  state  adiabatic  surface.  It  is  this  strong 
variability  of  E0  in  response  to  the  polaron  breathing  that  has 
already  been  noted  in  Fig.  5  and  is  largely  responsible  for  the 
resulting  enhanced  broadening  of  the  line  shape.  The 
“mechanism”  of  the  enhancement  is  also  illustrated  in  Fig.  7 
displaying  two  adiabatic  energy  surfaces,  E Q0t  and  F)01,  be- 


FIG.  7.  Total  adiabatic  energies  £“*  and  Ef  as  functions  of  the  scaling 
parameter  c  calculated  here  for  the  confinement  parameter  aB/R=  10.  The 
plot  illustrates  how  fluctuations  of  small  energy  Ar  translate  into  much 
larger  variations  A,,,  of  the  electronic  transition  energies. 


FIG.  8.  The  model  absorption  line  shapes  calculated  as  described  in  the  text 
for  aBIR= 4  and  aB/R=  10  and  for  two  different  temperatures:  kBT 
=0.2  Ry  (long-dash  curves)  and  kBT—  0.3  Ry  (solid  curves).  Vertical  short- 
dash  lines  show  the  positions  of  the  respective  electronic  transition  energies 
Ew  in  the  equilibrium  configurations.  The  low-frequency  tail  of /(&))  for  the 
case  of  aBIR=4  is  not  shown  completely  as  the  computational  scheme  used 
becomes  inadequate. 


tween  which  the  dominant  optical  transitions  take  place.  The 
behavior  of  F^ot  here  reflects  large  variations  SEP  of  the  po¬ 
larization  energy  without  a  comparable  compensation  of  the 
electronic  energy  that  occurs  for  F[J'1.  Figure  7  shows  how  a 
population  on  Fq°1  within  a  relatively  small  energetic  broad¬ 
ening  A7  translates  into  a  much  broader  distribution  A(il  of 
the  transition  energies. 

To  model  the  absorption  line  shape  as  a  function  of  fre¬ 
quency  co,  we  work  with  an  ensemble  average  over  various 
fluctuations  represented  by  different  values  of  the  scaling 
parameter  c  in  Eq.  (24).  The  resulting  oscillator  strength  den¬ 
sity  in  Eq.  (13)  is  then  a  Boltzmann  average  of  Eq.  (14) 
taking  the  form  of 

/(<*>)  =  I  dcP(c)^jfi0(c)S(co- w,o(c)),  (25) 

J  i 


where  the  probability 

PW=exp(-^r)/  /*“!>(- 


SE'0°Xc)\ 
kBT  ) 


determines  the  weighting  of  fluctuations  on  the  ground  state 
adiabatic  surface  Fg0t.  Both  transition  frequencies  oj,0(c)  and 
oscillator  strengths  /,0(c)  in  Eq.  (25)  are  functions  of  the 
scaling  parameter  c. 

Figure  8  shows  examples  of  the  absorption  line  shapes 
calculated  with  Eq.  (25)  for  two  values  of  the  confinement 
parameter  and  two  different  values  of  temperature  T.  The 
results  are  broad  peaks  around  the  respective  equilibrium 
transitions  frequencies  F10.  The  broadening  of  the  absorption 
lines,  as  we  discussed,  is  strongly  enhanced,  roughly  an  or¬ 
der  of  magnitude  larger  than  the  thermal  energies  in  our  ex¬ 
amples.  Note  that  both  photon  energy  hco  and  thermal  energy 
kBT  in  Fig.  8  are  measured  in  units  of  the  energy  scale  Ry, 
Eq.  (1).  If  we  were  to  use  lower  temperatures  than  shown  in 
the  figure,  say  kBT=0A  Ry,  the  broadening  would  decrease 
in  comparison  to  Ry  but  would  still  be  very  substantial  in 
comparison  with  kBT  itself.  Of  course,  the  essence  of  the 
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enhancement  mechanism  here  is  generic.  A  similar,  order-of- 
magnitude-enhanced,  broadening  have  been  reported  for  3D 
solvated  electrons.34  It  is  this  large  and  confinement- 
parameter-dependent  broadening  that  is  the  emphasis  of  Fig. 
8  rather  than  details  of  the  model  line  shapes. 


IV.  CONCLUSIONS 

As  a  result  of  solvation  by  the  surrounding  sluggish  po¬ 
lar  medium,  excess  charge  carriers  on  ID  SC  nanostructures 
can  undergo  self-localization  into  ID  polarons  whose  prop¬ 
erties  are  quite  different  from  band  carriers.  In  an  effort  to 
establish  experimentally  testable  signatures  of  the  polaron 
formation,  in  this  paper  we  examined  the  local  optical  ab¬ 
sorption  from  solvation-induced  ID  polarons  within  a  sim¬ 
plified  single-particle  adiabatic  framework  for  tubular  struc¬ 
tures. 

The  calculations  indicate  that  a  new  broad  absorption 
feature  is  expected,  the  peak  energy  of  which  is  comparable 
with  (but  larger  than)  the  binding  energy  of  the  polaron  and 
whose  width  is  much  larger  than  the  thermal  energy.  This 
feature  arises  mostly  due  to  the  electronic  transition  between 
two  lowest-energy  localized  electronic  levels  formed  in  the 
polarization  potential  well.  We  identified  the  polaron  length 
fluctuations  (breathing  mode)  as  the  major  source  of  the 
broadening. 

Similarly  to  the  enhanced  binding  of  ID  excitons,  the 
binding  of  ID  polarons  is  stronger  than  in  higher  dimen¬ 
sions,  and  we  explored  polaron  structure  and  optical  transi¬ 
tions  as  a  function  of  the  confinement  parameter  aBIR,  where 
aB  is  the  effective  3D  Bohr  radius  (1)  and  R  the  radius  of  the 
tubular  structure.  Of  particular  relevance  for  the  optical  ab¬ 
sorption  is  the  equilibrium  transition  energy  El0 ,  which  ap¬ 
proximately  determines  the  position  of  the  absorption  peak 
(Fig.  8).  Within  the  range  shown  in  Fig.  4,  the  scaling  of  this 
energy  may  be  approximated  as 

£10/Ry  « (aB/R)01.  (26) 

Note  that  the  relation  between  Ej0  and  the  binding  energy  Eh 
also  changes  with  the  confinement  parameter  (Fig.  3).  To¬ 
gether  with  the  parametric  dependence  (1)  of  the  Rydberg 
energy  Ry,  Eq.  (26)  allows  to  assess  how  the  position  of  the 
absorption  peak  scales  with  system  parameters  [compare  to 
scaling  of  excitons  in  SWCNTs  (Refs.  44  and  45).  It  follows 
then  that  the  effective  dielectric  constant  e*  of  the  solvent 
affects  it  as 

Ew  ^  (e")  1J- 

If  the  effective  mass  m  of  the  carrier  was  independent  of  the 
tube  radius  R ,  then  Eq.  (26)  would  result  in 

£j0  00  m03. 

In  SWCNTs,  however,  m  does  depend  on  tube  radius,  and, 
for  not  very  small  radius  tubes,  this  dependence  is  roughly 
/72  1  / R. '-1'44  in  this  regime,  the  confinement  parameter  be¬ 

comes  practically  independent  of  R,  and  the  transition  energy 
scales  as 


£10  m  R  l. 

For  the  specific  numerical  example  provided  in  Sec.  I,  the 
absorption  peak  could  then  be  expected  at  energy  close  to 
0.15  eV.  For  a  tube  half  as  wide,  under  the  same  dielectric 
conditions  the  peak  could  be  at  about  0.3  eV.  These  numeri¬ 
cal  examples  should,  however,  be  considered  only  as  repre¬ 
sentative,  as  they  are  subject  to  variations  of  the  effective 
system  parameters  and  model  details.  We  note  that  a  strong 
infrared  absorption  has  also  been  predicted  for  the  2D  elec- 
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trons  localized  at  the  metal-polar-dielectric  interface. 

To  reiterate,  illustrative  results  in  this  paper  have  been 
derived  using  simplified  model  considerations.  More  accu¬ 
rate  treatments,  particularly  of  temperature  effects,  may  be 
needed  for  detailed  comparisons  with  experimental  data 
when  it  becomes  available. 
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Using  long  C  VH2  conjugated  carbon  chains  with  the  polyynic  structure  as  prototypical  examples  of 
one-dimensional  (ID)  semiconductors,  we  discuss  self-localization  of  excess  charge  carriers  into  ID 
large  polarons  in  the  presence  of  the  interaction  with  a  surrounding  polar  solvent.  The  solvation 
mechanism  of  self-trapping  is  different  from  the  polaron  formation  due  to  coupling  with 
bond-length  modulations  of  the  underlying  atomic  lattice  well  known  in  conjugated  polymers. 
Model  ab  initio  computations  employing  the  hybrid  B3LYP  density  functional,  in  conjunction  with 
the  polarizable  continuum  model,  are  carried  out  demonstrating  the  formation  of  both  electron  and 
hole  polarons.  Polarons  can  emerge  entirely  due  to  solvation  but  even  larger  degrees  of  charge 
localization  occur  when  accompanied  by  atomic  displacements.  ©  2009  American  Institute  of 
Physics.  [DOI:  10.1063/1.3105337] 


I.  INTRODUCTION 

One-dimensional  (ID)  semiconductor  (SC)  structures 
such  as  7r-conjugated  polymers  (CPs),  nanotubes,  and  nano¬ 
wires  are  interesting  nanoscopic  objects  that  can  be  exploited 
in  various  areas  including  (opto)electronics,  energy  harvest¬ 
ing,  and  sensors.  The  nature  and  properties  of  excess  charge 
carriers  on  these  structures  are  fundamental  for  many  appli¬ 
cations.  As  covalent  bonds  provide  for  wide  electronic  bands 
and  effective  band  masses  much  smaller  than  the  free  elec¬ 
tron  mass,  intrinsically  excess  charge  carriers  in  these  sys¬ 
tems  have  a  high  propensity  for  delocalization  and  a  poten¬ 
tial  for  high  mobilities.  If  not  for  extrinsic  defects,  inherent 
limitations  on  the  mobility  arise  due  to  interactions  of  charge 
carriers  with  other  subsystems — most  commonly  with  dis¬ 
placements  of  the  underlying  atomic  lattice  of  the  ID  SC 
structure.  This  electron-phonon  coupling  has  been  a  subject 
of  numerous  studies  for  various  specific  ID  systems.  It  is 
well  known  that,  as  a  result  of  the  interaction  with  lattice 
phonons,  excess  charge  carriers  may  undergo  self¬ 
localization  into  polaronic  states.  Such  polarons  have  been 
extensively  discussed  in  the  context  of  CPs,  where  they  are 
believed  to  be  accompanied  by  localized  bond-length  modu¬ 
lation  patterns  and  new  features  in  the  optical  absorption 
(see,  e.g.,  Refs.  1  and  2  for  reviews  and  original  references). 

A  different  and  much  less  explored  implementation  of 
the  strong  polaronic  effect  can  take  place  when  a  ID  SC  is 
immersed  in  a  three-dimensional  (3D)  polar  medium,  the 
situation  is  of  particular  relevance  to  applications  and  pro¬ 
cesses  involving  fundamental  redox  reactions  in  polar  sol¬ 
vents.  In  this  case  of  excess  charge  carrier  solvation,  the 
long-range  Coulomb  interaction,  as  has  been  recently  dis- 
cussed  within  simplified  theoretical  models,  can  result  in 
the  formation  of  ID  adiabatic  large-radius  polarons,  where  a 
localized  electronic  state  on  a  SC  structure  is  surrounded  by 
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a  self-consistent  pattern  of  the  sluggish  (orientational)  polar¬ 
ization  of  the  solvent.  The  Coulomb  mechanism  of  the  po¬ 
laron  formation  here  is  analogous  to  the  well-known  3D  po- 
larons  in  polar  SCs  and  solvated  electrons  in  polar 
liquids.  ~  Distinct  from  those  cases  are  the  confinement  of 
the  electron  motion  in  a  ID  nanostructure  and  its  structural 
separation  from  the  3D  polarizable  medium.  The  energetic 
significance  of  the  solvation  has  been  emphasized  by 
finding4'3  that  the  binding  energy  of  the  resulting  ID  po¬ 
larons  could  reach  a  substantial  fraction,  roughly  one-third, 
of  the  binding  energy  of  Wannier-Mott  excitons,  the  well- 
known  primary  photoexcitations  in  many  ID  SCs.  This  may 
lead  to  enhanced  charge  separation.  On  the  other  hand,  the 
mobility  of  solvated  charge  carriers  is  drastically  reduced 
due  to  the  dissipative  drag  of  the  medium. 3,6,14  New  optical 
absorption  signatures  are  also  expected  as  a  result  of 
solvation.7'15 

Given  the  generic  character  of  the  solvation  mechanism 
of  charge  carrier  self-localization  and  its  potential  implica¬ 
tions  for  carrier  mobilities  and  redox  processes  on  ID  SC 
nanostructures,  this  effect  appears  interesting  and  important 
enough  to  warrant  studies  at  various  computational  levels.  As 
a  step  in  that  direction,  we  have  attempted  a  first — to  our 
knowledge — model  ab  initio  study  reported  in  a  recent 
communication13  that  would  demonstrate  the  formation  of 
solvation-induced  ID  large-radius  polarons.  In  this  paper  we 
elaborate  on  that  brief  report  discussing  features  of  solvation- 
induced  self-localization  of  excess  charges  on  finite  linear 
conjugated  carbon  chains  C^Hj  (polyyne  oligomers)  as  de¬ 
rived  within  density-functional  theory  (DFT)  computations 
in  conjunction  with  the  polarizable  continuum  model 
(PCM).16-18 

Ab  initio  calculations  are  expected  to  be  helpful  in  elu¬ 
cidating  the  role  played  by  valence  electrons  and  many- 
electron  interactions  in  accommodation  of  an  extra  charge 
carrier.  It  should  be  noted  that  a  reorganization  of  valence 
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band  electrons  can  be  important  even  in  noninteracting  elec¬ 
tron  models — a  nice  illustration  was  given  in  Ref.  19  that 
analyzed  how  polarons  of  a  two-band  Peierls  dielectric 
model  evolve  into  single-particle  Holstein”  polarons  in  the 
limit  of  the  “frozen  valence  band”  approximation.  With  real¬ 
istic  Coulomb  interactions  in  place,  the  role  of  valence  elec¬ 
trons  in  the  formation  of  the  relevant  self-consistent  poten¬ 
tials  would  only  increase.  Interestingly  enough,  despite  a 
relatively  long  history  of  first-principles  studies  of  electron- 
phonon  polarons  in  CPs  (see,  e.g.,  multiple  references  in 
Refs.  21  and  22),  certain  questions  have  been  raised  recently 
regarding  the  applicability  of  different  ab  initio  frameworks 
to  describe  the  formation  of  those  polarons.  That  pertains  to 
the  failure  of  the  local-density-approximation  and 
generalized-gradient-approximation  DFT  schemes  to  detect 
self-localized  charge  density  distributions  in  various  charged 
oligomers,  while  Hartree-Fock  (HF),  parametrized  semi- 
empirical  and  possibly  hybrid-functional  DFT  methods  have 
been  reported  to  lead  to  charge  localization  in  the  middle  of 
oligomers  (e.g.,  Refs.  22-25).  Quite  different  level-of- 
theory-dependent  magnitudes  of  the  effective  electron- 
phonon  coupling  have  also  been  found  even  in  neutral  poly¬ 
meric  systems  (see,  e.g.,  a  comparative  discussion  in  Refs. 
26  and  27).  A  caution  therefore  has  to  be  exercised  in  the 
interpretation  of  at  least  the  quantitative  aspects  of  various 
ab  initio  results. 

As  first-principles  calculations  are  applied  to  the  effect 
of  solvation  on  charge  carrier  self-localization  within  the 
PCM  framework,  it  would  also  be  desirable  to  have  a  com¬ 
prehensive  comparison  of  different  computational  ap¬ 
proaches.  Such  a  study  is  currently  underway  and  we  are 
planning  to  report  its  results  elsewhere.  Reassuringly,  the  ef¬ 
fect  of  solvation  on  excess  charge  localization  appears  quali¬ 
tatively  robust  in  those  computations.  While  providing  an 
illustrative  example  of  comparison  with  HF  computations,  in 
this  paper  we  rather  focus  on  a  more  modest  goal  to  discuss 
trends  that  are  derived  within  a  single  computational  scheme. 
Specifically,  we  employ  DFT  calculations  with  the  B3LYP 
hybrid  exchange-correlation  density  functional.  This  func¬ 
tional  is  known  to  include  both  local  and  nonlocal  effects  and 
has  been  commonly  used  in  recent  studies  ’  ’  of 
polyyne  and  its  oligomers. 

By  exploring  model  systems  with  prescribed  carbon 
atom  positions,  we  will  be  able  to  illustrate  excess  charge 
carrier  (an  electron  or  a  hole)  localization  and  the  resulting 
formation  of  large  ID  polarons  entirely  due  to  the  polariza¬ 
tion  of  the  surrounding  solvent.  Examples  of  self- 
consistent — that  is,  practically  independent  of  the  chain 
length — polaron  structures  will  thereby  be  also  given.  Within 
the  confines  of  the  computational  scheme  used,  “realistic” 
geometries  of  carbon  atom  arrangements  are  those  that  opti¬ 
mize  the  total  energy  of  the  system.  We  have  performed  such 
geometrical  optimizations  for  polyynic  chains  both  in 
vacuum  and  in  solvent.  In  the  range  of  chain  lengths  studied 
(up  to  N=  100  of  carbon  atoms)  with  the  B3LYP  hybrid  func¬ 
tional,  we  have  not  been  able  to  detect  a  clear  trend  toward 
excess  charge  self-localization  due  to  the  electron-phonon 
interaction  alone — as  the  chain  length  increased,  the  bond- 
length  modulation  pattern  would  only  become  flatter  in  the 


middle  of  the  chain.  On  the  contrary,  in  the  presence  of  the 
Coulomb  interaction  with  the  solvent,  clearly  localized  pat¬ 
terns  of  bond-length  alternation  (BLA)  have  been  found  to 
act  as  to  boost  the  degree  of  excess  charge  localization,  pos¬ 
sibly  reflecting  synergistic  effects  from  both  mechanisms  in 
this  case. 

II.  SYSTEMS,  COMPUTATIONS,  AND  DATA  ANALYSIS 

Since  the  computational  demand  increases  substantially 
in  DFT-PCM  calculations,  we  have  selected  structurally 
simple  ID  semiconducting  systems  of  hydrogen- terminated 
polyynic  linear  carbon  chain  CVH2  for  our  demonstration.  It 
should  be  stressed  however  that,  while  playing  the  role  of  a 
prototypical  example  in  our  study,  polyynic  chains  continue 
to  be  the  subject  of  much  attention  in  their  own  right  (see 
Refs.  26,  27,  31,  and  32  and  multiple  references  therein). 
Interestingly,  they  were  predicted  ’  to  possess  a  rich  family 
of  electron-lattice  self-localized  excitations  because  of  the 
extra  degeneracy  of  molecular  orbital  (MO)  levels. 

The  subject  of  our  attention  in  this  paper  is  the  long 
chain  CVH2  with  even  number  of  carbons  N  ( N  between  20 
and  100).  It  is  well  established  now26’27’29'35’36  that  even-/V 
long  (AS  10)  neutral  chains  have  their  ground  state  with  the 
polyynic  structure,  that  is,  they  feature  a  (nearly  uniform) 
alternating  pattern  of  triple  C  =  C  and  single  C-C  bonds 
leading  to  a  gap  in  the  electronic  spectrum.  The  ground  state 
of  odd-N  chains  would  exhibit  a  kinklike  “defect”  in  the 
bond-alternation  pattern  with  associated  mid-gap  states,  ' 
which  we  do  not  discuss  here.  Having  neutral  chain  QvH°  as 
a  benchmark,  we  have  studied  both  negatively  charged  CivHj 
(an  extra  electron)  and  positively  charged  CVH  j  (an  extra 
hole)  chains.  As  illustrated  below,  their  self-localization  be¬ 
havior  has  been  found  very  similar,  with  the  excess  charge 
measured  correspondingly  in  units  of  (— e)  or  (e),  where  e  is 
the  magnitude  of  the  fundamental  charge.  To  save  space, 
some  of  the  results,  therefore,  would  be  shown  only  for  the 
negatively  charged  systems. 

For  the  ab  initio  engine  in  this  study,  we  employ  DFT 
within  the  GAUSSIAN  03  (Ref.  16)  set  of  programs.  The  ma¬ 
jority  of  the  results  presented  in  this  paper  have  been  derived 
with  the  B3LYP  hybrid  exchange-correlation  functional 
which  includes  the  HF  exchange  along  with  DFT 
correlation.”  In  order  to  investigate  the  effects  of  solvation 
in  a  polar  medium,  GAUSSIAN  03  (Ref.  16)  offers  its  imple¬ 
mentation  of  the  PCM  described  in  original  publications. 37-40 
A  rich  6-31 1  +  +G(d,p)  all  electron  basis  set  has  been  em¬ 
ployed  throughout.  It  should  be  noted  that  the  results  of  our 
“in  vacuum”  calculations  have  been  verified  to  compare  well 

27 

against  the  previously  published  ab  initio  data  on  neutral" 
and  charged"  polyynic  systems  both  in  terms  of  energetics 
and  optimized  bond  lengths.  For  all  DFT-PCM  (“in  solvent”) 
calculations,  water  has  been  chosen  as  a  polar  solvent  with 
its  default  parameters  in  GAUSSIAN  03.  We  remind  the  reader 
that  the  focus  of  this  paper  is  to  compare  results  obtained  in 
vacuum  and  in  solvent  environments  within  the  same  com¬ 
putational  scheme  rather  than  compare  various  computa¬ 
tional  approaches. 

In  order  to  be  able  to  clearly  discern  the  effects  due  to 
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FIG.  1.  Spatial  charge  distributions  on  N=  100  polyynic  chains  with  RG1  geometry  derived  by  using  atomic  charges  from  Mulliken  (panel  columns  indexed 
1  and  3)  and  Lowdin  (panel  columns  2  and  4)  procedures.  Panel  row  (a)  is  for  the  totally  neutral  chains,  row  (b)  for  the  negatively  charged  chains  with  one 
added  electron,  and  row  (c)  depicts  the  distribution  of  excess  charge  defined  as  a  difference  of  the  data  in  rows  (b)  and  (a).  Two  left  columns  (1  and  2)  depict 
charges  per  atom,  two  right  columns  (3  and  4)  yield  charges  per  unit  cell.  In  each  of  the  panels,  solid  lines  show  the  results  for  chains  in  solvent,  whereas 
dashed  lines  (when  distinguishable)  for  chains  in  vacuum.  It  is  evident  that  cell-centric  excess  charge  distributions  in  panels  (c3)  and  (c4)  are  nearly  identical 
despite  substantial  differences  between  Mulliken  and  Lowdin  charges  observed  in  other  panel  pairs. 


the  solvation  and  provide  conceptual  proofs  and  illustrations 
(see  Sec.  Ill),  we  will  be  exploring  both  systems  with  the  full 
geometric  optimization  of  the  underlying  atomic  lattice  and 
systems  with  model  rigid  geometries,  that  is,  with  prescribed 
atomic  positions.  One  of  such  rigid  geometries,  referred  to  as 
RG1,  features  the  following  fixed  lengths:  1.235  A  for  the 
triple  bonds,  1.326  A  for  the  single,  and  1.063  A  for  the  C-H 
bonds.  These  lengths  have  been  retrieved  from  our  complete 
optimization  of  the  neutral  N=  80  chain  in  vacuum  and  are  in 
fact  relatively  close  to  optimal  values  (see  Sec.  IV).  Other 
rigid  geometries  have  also  been  explored  where  all  single 
C-C  bonds  are  artificially  stretched  in  successive  increments 
of  0.1  A  up  to  the  final  length  of  1.726  A,  the  latter  geometry 
is  denoted  as  RG2.  This  artificial  stretching  is  used  to  study 
the  effect  of  narrower  electronic  bandwidths  (larger  band  ef¬ 
fective  masses)  on  the  charge  localization. 

A  very  relevant  quantity  for  our  discussion  is  the  equi¬ 
librium  spatial  charge  distribution  over  the  ID  SC,  which  is 
naturally  related  to  atomic  charges  in  outputs  of  ab  initio 
calculations.  Importantly,  these  charges  are  calculated  from 
many-electron  wave  functions  and  thereby  reflect  responses 
due  to  all  electrons  in  the  system.  Two  procedures,  due  to 
Mulliken  and  Lowdin,  are  widely  used  for  charge  population 
analysis.  It  is  well  known  that  calculations  of  atomic 
charges  depend  on  the  basis  set  transformations  and  some¬ 
times  lead  to  artifacts  and  spurious  results.  Such  artifacts  are 
in  fact  apparent  in  our  illustration  in  Fig.  1  that  compares 
various  results  we  obtained  for  /V=  1 00  chains.  Differences 
between  Mulliken  and  Lowdin  charges  are  especially  clearly 
seen  for  raw  charge  populations  separately  on  neutral  [row 
(a)]  and  charged  [row  (b)]  chains,  whether  they  are  atomic  or 


unit-cell  centric.  Comparison  of  panels  (cl)  and  (c2),  how¬ 
ever,  indicates  that  certain  cancellation  effects  take  place  and 
both  methods  lead  to  less  dissimilar  results  when  one  is  in¬ 
terested  in  the  spatial  distribution  of  the  excess  charge  de¬ 
rived  as  the  difference  in  charge  densities  on  charged  and 
neutral  chains.  Still  those  atomic-centric  results  exhibit  pos¬ 
sibly  spurious  oscillations  which  may  be  artifacts  related  to 
the  presence  of  bond-oriented  charge  density  waves  in  our 
system.  This  suggests  “averaging”  of  the  charge  over  the  unit 
cells  consisting  of  pairs  of  neighboring  carbons  (for  conve¬ 
nience,  the  chain  end  cells  are  defined  here  to  include  the 
hydrogen  atomic  charges).  Panels  (c3)  and  (c4)  show  that 
various  artifacts  indeed  practically  disappear,  and  the  unit- 
cell-based  excess  charge  distributions  calculated  by  Mulliken 
and  Lowdin  procedures  become  remarkably  close.  We  con¬ 
firmed  this  conclusion  for  many  other  cases  of  interest.  Ac¬ 
cordingly,  it  is  these  stable  results  for  excess  charge  per  cell 
that  will  be  displayed  in  our  results  below  (using  raw  Low¬ 
din  charges). 


III.  SOLVATION-INDUCED  SELF-CONSISTENT 
CHARGE  LOCALIZATION  IN  MODEL  SYSTEMS 

In  real  systems,  both  atomic  displacements  and  polariza¬ 
tion  of  the  surrounding  medium  are  expected  to  take  place  in 
order  to  accommodate  an  excess  charge  carrier.  Moreover, 
given  the  nonlinear  nature  of  the  polaronic  effect,  both  sub¬ 
systems  may  act  in  a  synergistic  way.  In  order  to  better  un¬ 
derstand  the  contribution  coming  from  the  solvation,  we 
therefore  start  from  studying  model  systems  with  rigid  ge- 
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FIG.  2.  Excess  charge  density  spatial  distribution  on  negatively  charged 
chains  with  N=20,  40,  60,  80,  and  100 — top  to  bottom  curves,  re¬ 
spectively,  in  their  central  parts.  All  chains  feature  the  same  pattern  of  fixed 
bond  lengths  corresponding  to  RG1  geometry  described  in  text.  Panel  (a) 
displays  results  for  chains  in  vacuum,  panel  (b)  for  chains  in  the  solvent 
environment. 

ometries,  where  no  localized  bond-alternation  patterns  are 
allowed,  while  the  medium  “adjusts”  its  state  of  polarization. 


FIG.  4.  Excess  charge  density  spatial  distribution  on  charged  chains, 
(a)  N=  80.  The  evolution  of  the  charge  distribution  upon  stretching  the  single 
bonds  by  0.1  A  successive  increments  from  1.326  to  1.726  A — bottom  to 
top  in  a  group  of  5  clearly  distinct  curves  shown  by  different  connected 
symbols  and  denoted  in  solvent.  The  same — but  unconnected — data  sym¬ 
bols  are  used  to  show  the  results  for  chains  in  vacuum,  those  distributions 
largely  overlap,  (b)  Comparison  of  distributions  for  chains  in  the  solvent 
with  N=  40,  60,  80  (different  data  symbols),  and  100  (solid  line)  for  the 
geometry  RG2  featuring  the  longest  single  bonds.  Curves  for  N=60,  80,  and 
100  practically  coincide. 


A.  Spatial  distribution  of  the  excess  charge 

Figures  2  and  3  compare  excess  charge  density  distribu¬ 
tions,  negative  and  positive,  respectively,  calculated  as  de¬ 
scribed  in  Sec.  II  for  chains  of  varying  lengths  both  in 
vacuum  and  in  solvent.  It  is  evident  that  panels  (a)  and  (b)  of 
each  of  the  figures  display  quite  distinct  trends.  Disregarding 
the  end  effects  (end  cells  include  charges  on  hydrogens), 
vacuum  results  in  panel  (a)  reflect  the  “particle  in  a  box” 
behavior:  as  the  chain  length  increases,  the  excess  charge  is 
distributed  more  and  more  uniformly  over  the  whole  chain. 
In  a  sharp  contrast,  for  chains  in  the  solvent,  panel  (b),  the 
excess  charge  exhibits  much  more  localized  distributions 
around  the  central  parts  of  the  chains  to  the  extent  that  no 
appreciable  end  effects  are  present.  As  the  chain  length  in¬ 
creases,  those  distributions  show  a  very  clear  tendency  to¬ 
ward  convergence,  albeit  not  completely  achieved  within  the 
range  of  lengths  studied. 

Instead  of  further  increasing  the  chain  length  in  already 
demanding  computations,  we  now  attempt  to  decrease  the 


FIG.  3.  As  in  Fig.  2  but  for  positively  charged  CVH j  chains  with  N=  20,  40, 
60,  and  80 — top  to  bottom  curves,  respectively,  in  their  central  parts. 


spatial  extent  of  charge  localization  in  model  systems.  To 
this  end,  we  recall  that,  in  single-particle  models,  the  spa¬ 
tial  extent  of  the  polaron  is  determined  by  the  balance  of  the 
gain  in  the  potential  energy  of  the  self-localized  carrier  due 
to  its  interaction  with  the  medium  and  the  loss  in  its  kinetic 
energy  due  to  localization.  Decreasing  the  role  of  the  kinetic 
energy  would  lead  to  a  diminished  delocalization  propensity 
and  is  expected  to  shorten  the  polaron.  This  should  be 
achievable  with  an  increased  effective  band  mass  of  the  car¬ 
rier,  or  with  a  narrower  electron  bandwidth.  To  affect  this,  we 
chose  to  artificially  stretch  all  single  bonds  in  our  rigid 
model  system  as  described  in  Sec.  II  thereby  making  C  =  C 
“dimers”  more  and  more  separated  from  each  other.  Figure 
4(a)  (extra  electron)  and  Fig.  5(a)  (extra  hole)  illustrate  a 
dramatic  difference  in  the  response  of  the  excess  charge  dis¬ 
tributions  on  N=  80  chains  to  successive  single-bond  stretch- 


FIG.  5.  Excess  charge  density  spatial  distributions  on  charged  C^H,  chains, 
(a)  /V=80.  The  evolution  of  the  charge  density  distribution  upon  stretching 
bonds  as  in  Fig.  4.  (b)  Comparison  of  distributions  for  chains  in  the  solvent, 
with  /V=40,  60,  and  80  (different  data  symbols)  for  the  RG2  geometry. 
Curves  for  N=  60  and  80  practically  coincide. 
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ing  in  vacuum  and  in  solvent.  While  the  charge  density  in 
vacuum  shows  only  very  little  changes  in  this  process  and 
preserves  nearly  uniform  distributions  over  the  whole  chain, 
for  chains  in  solvent,  each  successive  stretching  step  indeed 
results  in  substantially  smaller  and  smaller  localization  re¬ 
gions.  As  the  extent  of  localization  in  the  longest-stretched 
geometry,  RG2,  appears  to  be  well  within  the  chain  length, 
we  can  now  compare  the  corresponding  excess  charge  distri¬ 
butions  for  several  chain  lengths.  Figure  4(b)  makes  it  evi¬ 
dent  that  the  distributions  for  N=  60,  80,  and  100  negatively 
charged  chains  in  solvent  practically  coincide.  Likewise,  dis¬ 
tributions  for  N=  60  and  80  positively  charged  chains  are 
practically  identical  in  Fig.  5(b).  This  confirms  that  the  ob¬ 
tained  charge  distributions  on  longer  chains  correspond  to  a 
fully  converged  self-consistent  pattern  of  excess  charge  lo¬ 
calization.  Importantly,  self-localization  of  the  excess  charge 
demonstrated  here  has  been  achieved  entirely  due  to  the  in¬ 
teraction  with  the  surrounding  solvent. 

Comparing  the  results  for  negatively  charged  and  posi¬ 
tively  charged  species  (Fig.  2  versus  Fig.  3  and  Fig.  4  versus 
Fig.  5,  one,  of  course,  notices  some  effects  of  charge- 
conjugation  (CC)  symmetry  breaking.  They  depend  on  the 
chain  length  as  affected  by  both  the  ends  (hydrogen  atoms) 
and  by  the  inherent  band  structure.  As  the  inherent  7r-orbital 
bands  are  nearly  CC  symmetric,  these  effects  are  evidently 
relatively  weak,  and  basically  the  localization  behavior  for 
electrons  and  holes  appears  very  similar.  This  fact  should  be 
emphasized  in  the  context  of  the  different  status  of  highest 
occupied  and  lowest  unoccupied  states  within  DFT  compu¬ 
tations,  and  observed  differences  in  electron  and  hole  po- 

larons  due  to  the  interaction  with  lattice  distortions  in  CPs 

2? 

with  more  complex  structures. 


B.  Molecular  orbitals 

We  now  turn  from  the  total  excess  charge  distribution  to 
individual  MOs  and  the  resulting  electronic  structure.  In  the 
neutral  C^FL  chains,  77-electron  energy  levels  exhibit  a  four¬ 
fold  degeneracy — in  addition  to  the  spin  degeneracy,  there  is 
a  twofold  degeneracy  with  respect  to  the  spatial  orientation 
of  77-orbitals.  If  the  chain  direction  is  chosen  as  the  7-axis, 
77-orbitals  can  be  said  to  be  oriented  either  along  the  x  or 
along  the  y  axis.  In  charged  chains,  spin-  and  orbital- 
orientation-degeneracy  are  generally  lifted.  Also,  in  the  pres¬ 
ence  of  the  polaronic  effect,  one  observes  both  spatially  lo¬ 
calized  and  delocalized  (over  the  chain  length)  individual 
electronic  states.  The  overall  picture  of  MO  energy  levels  in 
neutral  and  charged  chains  is  illustrated  in  Fig.  6  (CjooH® 
versus  C^qHj)  and  Fig.  7  (CggH®  versus  C80Ht),  while  Fig. 
8  illustrates  the  spatial  behavior  of  MOs  on  charged  C  i00FL 
chains  in  more  detail. 

Looking  at  the  electronic  structures  of  neutral  and 
charged  chains  in  Figs.  6  and  7,  one  can  compare  the  differ¬ 
ences  in  responses  between  chains  in  vacuum  and  chains  in 
solvent  as  well  as  compare  the  differences  caused  by  the 
geometry  of  carbon  atom  arrangements  (RG1  versus  RG2). 
Going  to  the  artificial  RG2  geometry  with  longer  C-C  bonds 
allows  us  here  to  decrease  the  polaron  size  and  to  illustrate 
some  features  that  would  otherwise  be  hard  to  observe  within 


vacuum  solvent  vacuum  solvent 


FIG.  6.  MO  energy  levels  for  neutral  (0)  and  charged  (—1)  C10oH2  chains 
with  rigid  RG1  [(a)-(d)]  and  RG2  [(e) — (h)]  geometries  both  in  vacuum  and 
in  the  solvent.  The  arrows  indicate  positions  of  HOMO  levels. 


the  range  of  chain  lengths  studied.  Probably  the  first  visually 
noticeable  difference  between  RG1  and  RG2  electronic 
structures  is  a  much  wider  band  gap  in  RG2 — as  indeed  one 
should  expect  based  on  a  much  larger  difference  in  the  hop¬ 
ping  integrals  corresponding  to  triple  and  single  bonds.  As 
such,  however,  the  size  of  the  band  gap  is  inconsequential  for 
the  degree  of  charge  localization  due  to  solvation.  Instead,  of 
direct  importance  here  is  a  much  narrower  bandwidth  in 
RG2,  which  is  easily  detected  via  a  higher  density  of  states 
[panels  (e)  versus  (a)  and  (g)  versus  (c)].  The  narrower  the 
bandwidth  is,  the  smaller  is  the  propensity  of  the  excess  car¬ 
rier  to  delocalize. 

A  common  feature  evident  in  the  results  for  charged 
chains  in  vacuum  is  the  appreciable  overall  energy  shifts 
with  respect  to  neutral  chain  MO  levels:  to  higher  energies 
for  excess  electrons  and  to  lower  energies  for  excess  holes. 
These  shifts  reflect  simple  electrostatic  effects  (the  electro¬ 
static  potential  with  respect  to  infinity)  for  chains  of  finite 
lengths.  Understandably,  shifts  of  such  an  origin  should  be¬ 
come  much  smaller  for  chains  surrounded  by  media  with 
high  dielectric  constants,  which  is  indeed  the  case  for  our 
chains  in  the  solvent  environment  (see  also  below  in  Sec. 
Ill  C) .  A  much  more  important  and  relevant  difference  in  the 
MO  structure  of  charged  chains  in  vacuum  and  in  solvent  is 
instead  that  solvated  chains  feature  local  electronic  levels 
clearly  separated  from  other  states.  In  the  case  of  negatively 
charged  chains  (Fig.  6),  it  is  the  highest  occupied  MO 


vacuum  solvent  vacuum  solvent 


FIG.  7.  MO  energy  levels  for  neutral  (0)  and  charged  (+1)  C80H2  chains 
with  rigid  RG1  [(a)-(d)]  and  RG2  [(e)-(h)]  geometries  both  in  vacuum  and 
in  the  solvent.  The  arrows  indicate  positions  of  LUMO  levels. 
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FIG.  8.  (Color  online)  Spatial  distribution  of  several  MOs  for  C10oHJ  chains  with  RG2  geometry  in  vacuum  (left  column)  and  in  solvent  (right  column).  The 
spin  state  of  orbitals  is  indicated  by  arrows,  while  the  spatial  orientation  by  letters  “x”  and  “y”  (chains  are  oriented  along  the  z-axis).  Orbital  energies  in  eV 
are  shown  in  the  right  of  the  columns.  Compared  MOs  around  HOMOs  are  sequentially  selected  from  a  list  ordered  in  orbital  energies. 


(HOMO)  accommodating  an  excess  electron.  In  the  case  of 
positively  charged  chains  (Fig.  7),  it  is  the  lowest  unoccupied 
MO  (LUMO)  accommodating  an  excess  hole.  Comparison 
of  panels  (d)-(b)  and,  especially,  panels  (h)-(f)  in  these  fig¬ 
ures  clearly  demonstrates  the  formation  of  local  levels — as 
would  be  caused  by  the  appropriate  polarization  of  the  envi¬ 
ronment.  The  appearance  of  localized  electronic  states  in 
self-consistent  “potential  wells”  is  one  of  the  hallmarks  of 
the  strong  polaronic  effect.911 

The  number  of  localized  electronic  states  appearing  due 
to  the  polaronic  effect  should  depend  on  the  nature  of  the 
system  and  the  effective  potential  well.  For  electron-lattice 
polarons  in  CPs,  local  levels  are  known  ’  to  split  off  from 
both  conduction  and  valence  bands.  The  short-range  charac¬ 
ter  of  the  effective  potential  well  in  that  case,  however,  limits 
the  number  of  such  states  (in  traditional  two-band  Peierls 
dielectric  models,1  there  would  be  only  two  local  levels:  one 
close  to  the  conduction  and  one  to  the  valence  band).  In  the 
case  of  the  long-range  interaction  with  the  polarizable  envi¬ 
ronment,  the  effective  potential  well  has  the  long-range  Cou¬ 
lomb  behavior  expected  to  result,  for  an  infinite  system,  in  a 
multitude  of  localized  states  of  various  spatial  symmetries 
converging  to  the  onset  of  the  delocalized  continuum.  This  is 
a  standard  picture  for  traditional  “single-band”  3D  polarons,9 
and  we  have  specifically  discussed  its  realization  for  ID  sys¬ 
tems  immersed  in  the  3D  polar  medium.4  7 

Restricting  a  more  detailed  illustration  now  to  the  nega¬ 
tively  charged  C100H2  chains,  Fig.  8  compares  the  spatial 
structure  of  HOMOs  and  several  other  MOs  (from  a  list  or¬ 
dered  in  orbital  energies)  in  vacuum  and  in  the  solvent  envi¬ 
ronment  as  retrieved  from  the  GAUSSVIEW  4  (Ref.  41)  rendi¬ 
tion  of  GAUSSIAN  03  outputs.  Here  we  use  results  only  for 
RG2  geometry  displaying  better  distinguishable  localized 
states.  (The  figure  of  course  also  illustrates  the  lifting  of  spin 
and  spatial-orientation  degeneracies.)  As  expected,  a  stark 
difference  is  seen  between  the  HOMOs  in  vacuum  and  sol¬ 
vent  cases:  the  former  shows  a  state  delocalized  over  the 
chain  length,  while  the  latter  indeed  exhibits  a  well-localized 
character.  In  addition  to  this,  however,  we  also  find  in  the 
right  column  of  Fig.  8  that  the  solvation-induced  negative 
polaron  features  various  occupied  localized  states  in  the  va¬ 
lence  band  region  of  energies  (nothing  like  this  happens  for 
valence  band  states  in  the  vacuum  case).  This  observation 
confirms  a  reorganization  of  valence  band  states  apparent  in 


the  corresponding  electronic  structure  in  Fig.  6(h).  Thus  the 

polaron  reveals  a  many-electron  structure,  more  complex 

3—7 

than  invoked  in  simplified  single-particle  models. 

Even  in  RG2  geometry  it  is  difficult  though  to  visually 
identify  the  LUMO  in  the  right  column  of  Fig.  8  as  an  un¬ 
occupied  localized  state  expected  to  be  formed  in  the  effec¬ 
tive  polarization  potential  well.4'7  While  that  state  wave  func¬ 
tion  evidently  exhibits  the  right  odd  symmetry  with  respect 
to  the  inversion  of  the  chain  axis,  the  length  of  the  chain 
appears  insufficient  to  establish  a  localized  character  of  the 
state  with  certainty.  Likewise,  within  the  length  of  our  oligo¬ 
mer,  we  cannot  establish  the  onset  of  what  would  become  a 
continuum  of  truly  delocalized  states  in  the  limit  of  infinite 
chains.  Similar  considerations  of  details  of  the  spatial  behav¬ 
ior  of  MOs  can  be  done  for  positively  charged  chains  which 
would  then  correlate  with  signatures  of  the  electronic  struc¬ 
ture  in  Lig.  7. 

C.  Energetics 

An  important  energetic  quantity  in  discussions  of  the 
polaronic  effect  is  the  polaron  binding  energy  referring  es¬ 
sentially  to  the  gain  in  the  total  energy  of  the  system  taking 
place  upon  self-localization  of  the  excess  charge  carrier.1911 
Calculations  of  the  binding  energy  of  solvation-induced  po¬ 
larons  on  ID  structures  are  very  straightforward  in  simplistic 
models.'  Unfortunately,  we  do  not  feel  we  have  a  reliable 
straightforward  procedure  for  the  determination  of  the  bind¬ 
ing  energy  of  solvation-induced  polarons  in  our  current  ab 
initio  calculations.  One  of  the  reasons  for  this  is  that  we 
cannot  control  the  state  of  the  polarization  of  the  solvent  as  it 
would  be  easily  done  by  prescribing  desired  patterns  of 
atomic  displacements  in  the  case  of  electron-lattice  polarons. 
Some  considerations  can,  however,  be  put  forward  based  on 
direct  outputs  of  GAUSSIAN  03  computations. 

Ligure  9  illustrates  such  considerations  for  excess  elec¬ 
trons.  One  very  well-defined  and  physically  meaningful 
quantity  characterizing  accommodation  of  an  excess  electron 
in  the  computational  outputs  is  the  electron  affinity 

FA  =  F°  -  F~ 

Mot  Mot’ 

corresponding  to  the  difference  in  the  total  system  energies 
for  neutral  (superscript  “0”)  and  negatively  charged  (“— ”) 
chains.  Another  readily  available  energy  is  the  energy  Lllmci 
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FIG.  9.  The  negative  of  electron  affinities,  —  EA,  (square  symbols)  and 
energies  £^UM0  of  neutral  chains  (triangles)  as  functions  of  the  number  of 
carbons  in  polyyne  oligomers  CvhF  with  rigid  geometries.  Vacuum  data 
points  are  connected  by  dashed  lines,  data  for  chains  in  solvent  by  solid 
lines.  Panel  (a)  displays  results  for  chains  with  RG1  geometry,  panel  (b) 
corresponds  to  chains  with  RG2  geometry. 


FIG.  10.  IPs  (square  symbols)  and  the  negative  of  HOMO  energies 
(— Ckomo)  of  neutral  chains  (triangles)  as  functions  of  the  number  of  carbons 
in  CnH2  with  rigid  geometries.  Vacuum  data  points  are  connected  by  dashed 
lines,  data  for  chains  in  solvent  by  solid  lines.  Panel  (a)  displays  results  for 
chains  with  RG1  geometry,  panel  (b)  corresponds  to  chains  with  RG2 
geometry. 


of  the  LUMO  in  the  neutral  system.  These  energies  are  com¬ 
pared  in  Fig.  9  as  — EA  versus  £lUM0  for  various  conditions 
as  functions  of  the  number  N  of  carbons  in  CVH2.  The  ratio¬ 
nale  for  this  comparison  is  that  in  the  limit  of  infinite  chains, 
N— s- °o,  one  might  hope,  in  the  spirit  of  Koopman’s 
theorem,^'4"  that  £^UM0  would  represent  the  extra  system 
energy  upon  an  addition  of  a  single  electron  into  the  lowest- 
energy  delocalized  state  with  vanishing  excess  charge  den¬ 
sity.  The  negative  of  the  electron  affinity,  —  EA,  on  the  other 
hand,  represents  the  extra  system  energy  upon  the  addition  of 
a  single  electron  when  the  system  is  allowed  to  completely 
reorganize  itself  to  achieve  the  minimum  possible  total  en¬ 
ergy.  Both  compared  “extra  energies”  are  of  course  negative 
in  our  data.  We  remind  the  reader  that  in  this  section  we 
discuss  systems  with  rigidly  positioned  carbons  so  that  reor¬ 
ganization  here  is  limited  only  to  the  states  of  other  electrons 
and  the  polarization  of  the  environment. 

A  striking  difference  is  observed  in  Fig.  9  when  results 
for  chains  in  vacuum  are  compared  to  chains  in  solvent.  In 
the  former  case,  the  data  correspond  to  -EA  >  ElumO’  while 
in  the  latter,  the  order  of  these  energies  is  reversed:  -EA 
<  f-ruviO’  clearly  showing  the  significance  of  screening  and 
reorganization  effects  taking  place  in  the  solvent  environ¬ 
ment.  Vacuum  data  points  are  evidently  very  far  from  dis¬ 
playing  a  convergence  to  the  infinite  chain  limit,  and  a  larger 
steepness  of  the  vacuum  EA  as  a  function  of  N  is  indicative 
of  the  magnitude  of  the  Coulomb  effects  involved  for  finite 
excess  charge  densities  in  oligomers.  While  data  points  for 
chains  in  solvent,  of  course,  also  exhibit  an  /V-dependence, 
this  dependence  is  shallower  than  in  the  vacuum  case,  and 
for  RG2  geometry  [panel  (b)]  displays  a  nearly  converged 
behavior  for  largest  N  studied.  As  we  discussed  in  Sec.  Ill  A, 
a  self-consistent  polaron  formation  is  achieved  in  RG2 
within  our  range  of  chain  lengths. 

Considering  the  positive  energetic  “improvement” 
£^UMo+EA  in  the  solvent  environment,  its  N=  1 00  magni¬ 
tudes  in  Fig.  9  are  equal  to  approximately  0.1  and  0.11  eV 
for  RG1  and  RG2  geometries,  respectively.  If  one  were  to 
take  this  energetic  improvement  in  the  i  nfi  nite-/V  limit  as 


corresponding  to  the  “true”  solvation-induced  polaron  bind¬ 
ing  energy,  then  the  obtained  numbers  —0.1  eV  would  pro¬ 
vide  estimates  of  the  polaron  binding. 

An  analogous  consideration  for  excess  holes  involves  a 
comparison  of  the  ionization  potential  (IP) 

IP  =  F+  -  F° 

lr  ^tot  'Aot 

and  the  negative  -E^omo  °f  the  energy  of  the  FIOMO  in  the 
neutral  chain.  This  comparison  of  now  positive  extra  system 
energies  is  illustrated  in  Fig.  10.  The  figure  displays  qualita¬ 
tive  patterns  similar  to  observed  above  for  excess  electrons 
(while,  of  course,  also  showing  signs  of  CCS  breaking).  Par¬ 
ticularly,  one  notes  while  for  chains  in  vacuum  one  has  IP 
>— E-homo  the  relationship  reverses  for  chains  in  solvent: 
IP  <  -/-’homo-  The  N=  80  values  of  the  energetic  improve¬ 
ment  -£homo_IP  in  the  solvent  environment  are  approxi¬ 
mately  0.1  and  0.08  eV  for  RG1  and  RG2  geometries,  re¬ 
spectively,  which  could  again  suggest  polaron  bindings  close 
to  0.1  eV. 

It  appears,  however,  strange  that  these  naive  estimates 
lead  to  relatively  small  differences  (and  even  to  a  reversal)  in 
polaron  bindings  for  RG1  and  RG2  geometries — while  the 
spatial  localization  has  been  found  much  more  significant  in 
RG2  (Sec.  Ill  A).  We  believe  this  indicates  that  the  actual 
polaron  binding  energies  would  be  underestimated  in  dis¬ 
cussed  energetic  comparisons.  It  is  quite  likely  that  £lUM0 
and  -Ehomo  do  not  1'ePresent  fully  the  lowest  extra  energy  of 
systems  where  an  excess  delocalized  electron  or  an  excess 
delocalized  hole  would  interact  with  a  nonuniform,  charge 
density  wave  (CDW),  background  of  the  polymer  chain. 

Bond-centric  CDWs  are  known  to  form  on  neutral 
chains  upon  their  dimerization  with  more  electronic  density 
corresponding  to  shorter  (C  =  C)  and  less  electronic  density 
to  longer  (C-C)  bonds.  Such  CDWs  have  been  widely  dis¬ 
cussed  in  the  context  of  CPs  as  Peierls  insulators.1  Being  a 
nonvanishing  distribution  of  the  electric  charge,  a  CDW  is 
expected  to  polarize  the  solvent,  which  is  indeed  how  we 
interpret  the  data  shown  in  Fig.  11.  The  figure  displays  the 
electrostatic  energy  [/pol  stored  in  the  solvent  polarization  as 
retrieved  from  outputs  of  GAUSSIAN  03  computations.  It  is 
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FIG.  11.  The  electrostatic  energy  stored  in  the  solvent  polarization  as  a 
function  of  the  number  N  of  carbons  in  CV1L.  Triangular  data  points  are  for 
neutral  C^H?,  squares  for  charged  CyHF  chains.  The  dashed  lines  connect 
the  data  for  RG1  chain  geometry,  the  solid  lines  for  RG2. 

evident  that  this  energy  does  not  vanish  for  neutral  chains 
and  in  fact  grows  linearly  with  the  number  of  carbons  N,  as 
one  would  expect  for  a  CDW.  Since  the  amplitude  of  CDW 
is  enhanced  in  neutral  chains  with  RG2  geometry,  they  do 
exhibit  a  steeper  growth  with  N.  It  would  be  interesting  to 
evaluate  the  interaction  of  a  CDW  with  the  solvent  using 
more  tractable  analytical  approaches  based  on  atomistic 
models  of  both  the  CP  chain  and  the  solvent.  (A  related  ex¬ 
ample  of  structure-based  calculations  is  given  in  Ref.  43 
treating  the  Coulomb  interaction  of  CDWs  on  neighboring 
polyacetylene  chains.)  That  would  allow  clarification  of  the 
geometrical  regimes  of  the  applicability  of  the  continuum 
solvent  description. 

Figure  1 1  also  shows  the  evolution  of  Upo i  for  negatively 
charged  CVHA  chains.  Especially  illuminating  here  are  the 
results  for  chains  with  RG2  geometry  exhibiting  a  nonmono¬ 
tonic  behavior  of  f/pol  as  a  function  of  N.  Upo \  first  decreases 
with  N  at  smaller  N,  reflecting  the  decrease  in  the  electric 
field  of  the  excess  charge  upon  its  spreading  over  longer 
oligomers.  f/pol  then  reaches  its  minimum  in  the  region  of  N 
corresponding  to  the  formation  of  the  self-consistent  local¬ 
ized  polaron  pattern.  With  a  further  increase  in  N,  Upol  starts 
growing  again  with  the  same  slope  that  is  exhibited  by  neu¬ 
tral  chains  reflecting  the  interaction  of  the  CDW  with  the 
solvent.  The  converged  large- A'  difference  in  Upo\  of  charged 
and  neutral  chains  with  RG2  geometry  is  evidently  quite 
substantial,  close  to  0.9  eV.  While  the  minimum  of  t/pol  is  not 


achieved  for  negative  chains  with  RG1  geometry  within  the 
range  of  chain  lengths  studied,  one  could  guess  from  the  data 
in  Fig.  11  that  the  large -A/  difference  in  t/pol  of  charged  and 
neutral  chains  in  that  case  would  probably  be  roughly  half  of 
the  value  for  RG2  geometry.  This  ratio  would  look  like  more 
in  line  with  charge  localization  patterns  observed  for  RG1 
and  RG2.  The  very  magnitude  of  the  “excess”  polarization 
energy  just  discussed  may  be  indicative  of  the  polaron  bind¬ 
ing  energies  appreciably  larger  than  0. 1  eV.  This  speculative 
conclusion  could  be  verified  more  directly  if  we  were  able  to 
perform  ab  initio  computations  for  charged  chains  with  the 
polarization  state  of  the  solvent  corresponding  to  that  for 
neutral  chains. 

IV.  SOLVATION  AND  BOND  ALTERNATION  PATTERNS 

We  now  turn  our  attention  to  the  results  obtained  with 
full  geometrical  optimizations  of  underlying  carbon  lattices. 
A  very  convenient  and  well-known1"6  -7  way  to  characterize 
dimerized  polymeric  structures  is  via  BLA  patterns 

<5„  =  (-i)"(4-U), 

where  ln  is  the  length  of  the  nth  carbon-carbon  bond.  In  the 
infinite  dimerized  neutral  structure,  this  pattern  would  be 
uniform,  that  is,  independent  of  the  spatial  bond  position  n. 

Figure  12  compares  BLAs  computed  for  neutral  and 
negatively  charged  oligomers  C40H2,  C80H2,  and  C10oH2. 
This  comparison  allows  one  to  visually  evaluate  the  magni¬ 
tude  of  oligomer  end  effects.  It  is  clear  that  a  uniform, 
infinite-polymer,  behavior  is  practically  achieved  over  a  long 
central-part  stretch  of  the  neutral  CI00H2.  There  is  very  little 
difference  in  BLAs  of  neutral  chains  in  vacuum  and  in  sol¬ 
vent;  a  small  increase  in  Sn  in  the  solvent  is  consistent  with 
our  expectation  for  a  CDW  interacting  with  the  environment. 

Much  more  significant  variations  are  observed  for 
charged  oligomers.  As  was  discussed  at  length  for  electron- 
lattice  polarons  in  CPs,  ’  the  magnitude  of  BLAs  would  ex¬ 
hibit  a  dip  in  the  spatial  region  containing  an  excess  particle, 
where  the  lattice  becomes  “less  dimerized.”  This  expected 
decrease  in  S„  is  evident  in  all  panels  of  Fig.  12.  The  char¬ 
acter  of  its  evolution  with  the  oligomer  length,  however,  is 
quite  different  for  chains  in  vacuum  and  in  solvent.  The 
vacuum  data  just  show  a  progressive  reduction  in  the  dip  in 
<5„  for  longer  oligomers.  In  the  absence  of  data  for  much 


FIG.  12.  Carbon-carbon  BLA  patterns  for  (a)  C40H2,  (b)  CSoH2,  and  (c)  CmoFL.  Patterns  for  neutral  chains  are  indicated  with  symbol  (0),  patterns  for 
negatively  charged  chains  with  (—1).  The  dashed  lines  show  results  for  chains  in  vacuum,  the  solid  lines  for  chains  in  the  solvent  environment. 
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FIG.  13.  Comparison  of  the  solvation  and  BLA  effects  for  CIOoH2.  (a)  Ex¬ 
cess  charge  distributions  in  vacuum  and  in  the  solvent  for  the  fully  opti¬ 
mized  lattice  geometry  (solid  lines)  and  for  the  RG1  geometry  (dashed 
lines),  (b)  The  corresponding  MO  energy  levels  in  the  solvent  for:  (bl) 
Optimized  neutral  chain,  (b2)  Charged  chain  in  RG1  geometry,  (b3)  Opti¬ 
mized  charged  chain.  The  arrows  show  HOMO  levels. 


longer  chains,  all  we  can  conclude  is  that  a  very  flat  pattern 
of  Sn  in  the  middle  of  CiqqHj  does  not  suggest  yet  an  incipi¬ 
ent  formation  of  a  self-localized  pattern  characteristic  of  po- 
larons.  This  is  reminiscent  of  the  apparent  failure  of  DFT 
computations  to  detect  electron-lattice  polarons  in  computa¬ 
tionally  tractable  oligomers  discussed  in  Sec.  I — with  our 
results  now  extended  to  hybrid  B3LYP  computations.  On  the 
contrary,  the  data  derived  for  chains  in  solvent  quite  clearly 
indicate  a  nearly  self-consistent  localized  pattern  of  Sn  on 
CkjqH,,  although,  of  course,  longer  oligomers  would  be 
needed  to  reveal  a  truly  self-consistent  polaron  size. 

These  observations  correlate  well  with  the  results  ob¬ 
tained  for  excess  charge  densities  on  C^H]  that  are  dis¬ 
played  in  panel  (a)  of  Fig.  13.  The  panel  compares  the  excess 
density  on  chains  with  the  uniform  BLA  of  neutral  chains 
(RG1  geometry)  and  on  chains  where  the  relaxation  of  the 
BLA  due  to  the  excess  charge  has  taken  place.  One  can  see 
that  while  the  relaxation  of  the  BLAs  does  lead  to  spatial 
contractions  of  charge  densities  both  in  vacuum  and  in  sol¬ 
vent,  the  delocalized  (over  the  whole  oligomer)  nature  of  the 
excess  charge  remains  intact  in  the  vacuum  case.  Excess 
charge  localization  due  to  solvation,  on  the  other  hand,  gets 
evidently  enhanced  by  the  localized  lattice  response  seen  in 
Fig.  12(c).  One  could  probably  say  that  solvation  and  lattice 
relaxation  here  act  synergistically  to  affect  excess  charge 
self-trapping.  With  our  B3LYP  computations,  the  effect  of 
the  solvation  on  the  excess  charge  distribution  is  evidently 
much  more  pronounced.  A  complementary  information  is 
provided  by  panel  (b)  of  Fig.  13,  showing  the  effects  of 
solvation  and  BLA  relaxation  on  the  electronic  structure  of 
MO  levels.  Particularly  relevant  here  are  the  intragap  levels. 
Interestingly,  for  the  energies  of  these  levels  effects  due  to 
each  of  the  mechanisms  appear  similar  in  magnitude.  This 
suggests  that  one  has  to  be  careful  in  evaluating  self¬ 
localization  per  se  based  just  on  the  appearance  of  the  MO 
levels  in  their  overall  structure.  The  spatial  behavior  of  indi¬ 
vidual  MOs,  as  well  as  the  total  spatial  distribution  of  the 
excess  charge,  needs  to  be  properly  analyzed. 


FIG.  14.  Results  of  restricted-HF  computations  for  C40H2  oligomers,  (a) 
Excess  charge  density  distribution  on  C40H2  in  vacuum  and  in  solvent. 
Dashed  lines  show  the  distribution  on  chains  with  a  uniform  BLA  pattern  of 
the  neutral  system,  solid  lines  are  for  chains  with  relaxed  (optimal)  geom¬ 
etries.  (b)  Optimal  BLA  patterns  for  neutral  (0)  and  negatively  charged  (— 1) 
oligomers.  The  dashed  lines  are  for  chains  in  vacuum,  the  solid  lines  are  for 
chains  in  solvent. 


A.  Hartree-Fock  results 

As  mentioned  in  Sec.  I,  in  this  paper  we  do  not  pursue  an 
extensive  comparison  of  results  obtained  with  various  ab  ini¬ 
tio  methods.  To  emphasize  the  level-of-theory  dependence  of 
at  least  the  quantitative  aspects,  we,  however,  find  it  perti¬ 
nent  to  provide  a  glimpse  at  the  nature  of  results  derived  with 
HF  computations.  HF  calculations  for  conjugated  polymeric 
structures  have  been  discussed26"7  to  result  both  in  overesti¬ 
mated  energy  gaps  and  magnitudes  of  BLAs.  At  the  same 
time,  unlike  pure-DFT  computations,  they  have  also  been 
shown  to  yield  self-localized  polaronic  states  for  excess 
charge  carriers.  “  Our  findings  for  polyynic  chains  agree 
well  with  these  known  trends. 

Figure  14  illustrates  our  restricted-HF  results  for  neutral 
and  negatively  charged  C40H2  chains  both  in  vacuum  and  in 
the  solvent  environment;  they  are  consistent  with  a  limited 
set  of  data  we  have  also  computed  for  longer  chains.  When 
compared  to  B3LYP-derived  results  discussed  earlier  in  this 
paper  and  in  Ref.  15,  one  immediately  notices  an  appreciably 
higher  degree  of  excess  charge  density  localization  and 
larger  magnitudes  of  BLAs  obtained  with  the  HF.  Reassur¬ 
ingly,  the  qualitative  effect  of  the  solvent  environment  re¬ 
mains  intact:  it  does  enhance  the  excess  charge  localization 
[panel  (a)  of  Fig.  14]  and  leads  to  a  shorter  and  more  pro¬ 
nounced  associated  dip  in  the  optimized  BLA  pattern  [panel 
(b)].  Differently  from  the  B3LYP-data,  however,  solvation 
here  does  not  appear  to  act  as  a  “primary”  source. 

In  agreement  with  previous  HF  studies,  we  do  find  the 
excess  charge  self-localized  already  on  chains  in  vacuum. 
One  might  be  tempted  to  say — as  expected  for  traditional 
electron-lattice  polarons — that  the  calculated  self-trapping  in 
this  case  is  caused  by  the  interaction  with  displacements  of 
carbon  atoms  manifested  in  the  relaxed  BLA  pattern  in  Fig. 
14(b).  The  comparison  made  in  Fig.  14(a),  however,  does  not 
seem  to  support  such  a  viewpoint.  Specifically,  we  compare 
in  that  panel  the  excess  charge  densities  obtained  for  fully 
relaxed  geometries  and  for  a  rigid  geometry  featuring  a  uni¬ 
form  BLA  pattern.  The  latter  corresponds  to  the  BLA  derived 
for  neutral  chains  in  their  middle  portion  [Fig.  14(b)].  De¬ 
spite  the  uniform  lattice  background,  the  excess  charge  local- 
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ization  evidently  takes  place,  and  with  minimal  variations 
from  the  fully  geometrically  relaxed  chains.  Moreover,  the 
lattice  relaxation,  while  indeed  decreasing  the  total  system 
energy,  counterintuitively  results  here  in  slightly  smaller  lo¬ 
calization.  (To  compare,  in  B3LYP  results  geometrical  relax¬ 
ation  would  always  lead  to  larger  spatial  charge  localization.) 
Thus,  it  is  rather  the  treatment  of  electron-electron  interac¬ 
tions  within  the  framework  of  HF  computations  that  appears 
to  affect  the  excess  charge  localization  observed  in  our  HF 
data.  In  this  paper,  we  forgo  a  discussion  of  the  physical 
validity  of  these  results  that  would  require  a  more  focused 
study. 

V.  DISCUSSION 

Self-localization  of  excess  charge  carriers  (electrons  or 
holes)  into  polaronic  states  on  ID  SC  structures  is  an  inter¬ 
esting  effect  that  may  be  consequential  for  transport,  optical, 
and  electrochemical  properties  of  these  systems.  The  basic 
nature  of  the  self-trapping  is  easily  revealed  in  a  single¬ 
particle  picture  within  a  standard  ID  continuum  adiabatic 
framework,  where  a  localized  carrier  wave  function  ip{x)  is 
found  as  corresponding  to  the  ground-state  solution  of  the 
nonlinear  Schrodinger  equation, 

h 2  cPiMx)  f  , 

- Tl- -  dx,V(x-xf)\i/^x')\2i/^x)  =  Eil^x). 

2m  dx  J 

Here  x  is  the  coordinate  along  the  structure  axis,  m  is  the 
effective  band  mass  of  the  carrier,  and  V(x)  is  the  effective 
self-interaction  mediated  by  another  subsystem.  In  the  case 
of  a  short-range  electron-phonon  mediation,  the  self¬ 
interaction  may  be  approximated  as  local:  F(x)=g<5(.>c)  lead¬ 
ing  then  to  the  exact  solution  i/^x)ccsech(gmxl2h2)  for  a 
continuum  ID  polaron.  This  is  the  result  widely  known  after 
the  pioneering  contributions  of  Rashba44  and  Holstein. Jl  An¬ 
other  type  of  mediation  arises  due  to  the  long-range  polar¬ 
ization  interaction  of  an  excess  charge  carrier  with  a  sur¬ 
rounding  polar  medium.  In  that  situation,  the  effective  V(x) 
behaves  as  l/|x|  at  large  distances,  while  the  short-range 
behavior  depends  on  specific  system  details  such  as,  e.g.,  the 
actual  transverse  charge  density  distribution  and  the  geom¬ 
etry  of  the  dielectric  screening.3'4'7  Numerical  studies  of  this 
case4'3  indicate  that  the  binding  energy  of  the  resulting  po- 
larons  could  reach  an  appreciable  fraction  of  the  binding  en¬ 
ergy  of  the  corresponding  primary  optical  excitations, 
Wannier-Mott  excitons. 

While  illuminating  the  basic  physics  of  the  self-trapping, 
simplified  single-particle  descriptions  have  the  drawback  of 
not  explicitly  including  valence  band  electrons,  whose  inter¬ 
actions  and  reorganization  may  substantially  affect  the  out¬ 
comes.  Numerous  ab  initio  calculations  have  been  dedicated 
to  effects  of  electron-electron  interactions  on  electron-lattice 
polarons  in  CPs.  In  this  paper  we  have  extended  our  first 
model  ab  initio  study13  intended  to  address  these  effects  in 
the  formation  of  ID  polarons  due  to  the  interaction  with  a 
surrounding  polar  solvent.  As  our  study  is  performed  within 
the  framework  of  the  PCM,  it  could  also  be  thought  of  as  a 
new  avenue  for  applications  of  quantum-mechanical  con¬ 
tinuum  solvation  models.17'18 


By  studying  and  comparing  various  model  systems 
based  on  long  CaFC  oligomers  with  the  polyynic  structure, 
we  have  been  able  to  successfully  demonstrate  the  self- 
consistent  formation  of  ID  large  electron  and  hole  polarons 
entirely  due  to  solvation.  Reorganization  of  valence  band 
electrons  and  a  more  complex  many-electron  structure  of  the 
resulting  polarons  have  been  also  explicitly  illustrated.  Full 
geometrical  optimization  of  the  underlying  carbon  lattice  has 
shown  that  localized  BLA  patterns  may  act  synergistically 
with  the  solvent  reorganization  to  even  further  increase  the 
degree  of  excess  charge  self-localization. 

It  should  be  reiterated  that,  while  providing  clear  con¬ 
ceptual  demonstrations  and  a  seemingly  appealing  physical 
picture  of  solvation-induced  self-localization,  our  current 
study  is  limited  both  in  terms  of  its  scope  and  direct  quanti¬ 
tative  applicability  to  real  systems.  Among  other  things,  we 
have  already  discussed  and  illustrated  a  relatively  strong  de¬ 
pendence  of  the  results  on  the  level  of  ab  initio  theory  used, 
a  common  feature  of  such  computations. “6  _7  DFT  computa¬ 
tions  with  hybrid  exchange-correlation  functionals  may  be 
“appropriate”  engines  for  the  problem  at  hand,  but  a  more 
comprehensive  comparative  study  is  required  before  conclu¬ 
sions  can  be  drawn.  The  PCM  framework  as  used  in  this 
paper  may  overestimate  the  effects  of  solvation  on  self¬ 
localization  as  it  does  not  take  a  proper  account  of  the  fre¬ 
quency  dependence  of  the  actual  solvent  dielectric 
functions.4'3'8'11  This  issue  could  possibly  be  analyzed  within 
newly  developed  time-dependent  PCM  schemes.45'46  As  the 
computational  demand  for  calculations  of  ID  large  polarons, 
especially  in  application  to  more  complex  SC  structures,  may 
turn  out  to  be  excessive,  one  should  also  be  looking  for  more 
efficient  computational  approaches  such  as,  e.g.,  the  scaling 
description  of  solvent  effects.47 
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SPRING  PROGRESS  REPORT: 

Novel  Proton  Conducting  Membranes  for  PEM  Fuel  Cells 
Summary 

We  report  that  we  have  made  promising  lead  in  developing  polymeric  membranes  operable  as 
proton  exchange  membrane  (PEM)  to  meet  the  DOE  guidelines.  The  successful  PEM  must  have 
good  proton  conductivity  with  mechanical,  thermal,  and  chemical  stability  under  fuel  cell 
operation  over  a  wide  range  of  temperature  (-20  to  120  °C)  and  relative  humidity  (25  to  100%) 
conditions.  To  meet  these  requirements,  we  have  designed  novel  polymers  with  three  different 
design  concepts  (see  Strategies  1-3  as  shown  below).  Since  then,  we  have  already  prepared 
several  polymers,  and  we  now  are  in  the  process  of  optimizing  these  to  fabricate  into  new  proton 
exchange  membranes. 

At  present,  highest  proton  conductivity  value  is  up  to  0.07  S/cm  at  80°C  and  90%  RH  with  the 
polymers  made  from  the  Strategy  f .  The  polymers  showed  good  thermal  stability  even  at  200  °C. 
Tests  at  various  conditions  with  further  optimization  of  the  polymers  are  underway  to  find  if  the 
polymers  can  perform  to  meet  with  the  DOE  guideline  of  the  conductivity  >0.1  S/cm  at  120°C 
and  <  25%  RH.  Additionally,  synthesized  polymers  from  the  Strategies  2  and  3  have  also  been 
characterized,  and  testing  them  under  fuel  cell  conditions  will  be  started  soon. 

Background 

Nafioiv  is  the  current  dominant  material  for  PEM  fuel  cells  and,  typically,  operates  at  low 
temperature  (<80°C)  and  high  relative  humidity  (R.H.).  However,  it  is  unsuitable  for  operation 
at  higher  temperatures  (>120°C)  and/or  low  R.H.  conditions  because  of  significant  loss  in  proton 
conductivity  due  to  dehydration,  compromised  mechanical  strength  due  to  its  relatively  low  glass 
transition  temperature  (Tg)  in  the  dry  state  (~1 10  °C),  and  even  lower  in  the  hydrated  state,1  and 
susceptibility  to  chemical  degradation  by  peroxide-derived  free  radicals  formed  under  FC 
operation.  The  present  DOE’s  year  2010  guidelines  for  PEM  applications  include  proton 
conductivity  >0.1  S/cm  and  durability  for  2-5,000  hrs  at  120°C  /  <25%  RH,2  and  Nafion1'  fall 
short  to  these  required  properties. 

Our  PEM  designs  involve  the  use  of  copolymers  comprising  hydrophobic  and  hydrophilic 
domains.  The  combination  of  the  hydrophilic  and  hydrophobic  segments  will  result  in  the 
incorporation  of  the  desired  aggregation  of  the  rigid  hydrophobic  backbone  for  mechanical 
stability  and  minimal  water  swelling,  and  the  high  concentration  of  acidic  functionalities  for 
better  proton  conductivity.  The  use  of  backbones  with  glass  transition  temperatures  (Tg)  >120 
°C  will  also  enhance  membrane  stability  at  high  temperatures.  The  hydrophobic  block  will 
consist  of  perfluorocyclobutane  (PFCB)  aromatic  ether  linkages  formed  by  thermal  cyclization 
of  triflu orovinyloxybenzene  (TFVOB)  groups  without  side  products.  The  linear  prepolymers  or 
block  segments  can  be  casted  into  membranes  from  common  organic  solvents,  and  the  resultant 
membranes  can  be  further  thermally  treated  to  obtain  high  molecular  weight  copolymers  or  a  3-D 
cross-linked  network.  These  highly  processable  PFCB  polymers  have  been  reported  to  be 


thermally,  mechanically  and  oxidatively  robust  with  Tg’s  >200  °C. 3,4,5  This  combination  of 
properties  makes  them  promising  for  the  hydrophobic  backbone  in  the  proposed  copolymers. 

Hydrophilic  side  chains  will  contain  highly  acidic  functionalities  such  as  sulfonimides  and 
sulfonic  acids,  thereby  maximizing  IECs  (greater  than  1  mmol/g)  for  the  copolymers.  The  side 
chains  will  be  flexible,  facilitating  aqueous  micelle  fonnation  without  disrupting  the  hydrophobic 
domains.  (Per)  fluorinated  side  chains  will  also  be  used  to  enhance  the  overall  stability  under 
fuel  cell  operating  conditions. 

Strategy  1:  Copolymers  with  hydrophobic  PFCB-based  backbones  and  highly  acidic  side 
chains 

The  general  structure  of  the  block  copolymer,  consisting  of  hydrophobic  and  hydrophilic 
segments,  is  shown  in  Fig.  1.  The  proposed  hydrophobic  rigid  backbone  was  derived  from  the 
TFVO  derivatives  shown  in  Fig.  2. 


The  salt  form  of  Zi  in  the  perfluorinated  side  chain  will  be  converted  into  the  acid  fonn  by 
hydrolysis  in  aqueous  acid  after  membrane  fabrication,  giving  two  proton  sources  for  each  side 
chain,  which  will  lead  to  IEC  values  up  to  2  mmol/g,  a  two-fold  increase  vs.  Nation  .  The  high 
density  of  the  acid  sites  in  the  membrane  will  enhance  the  water  uptake  ability  and,  therefore, 
proton  conductivity  at  high  temperature  and  low  humidity  conditions. 

For  PFSA  polymers  such  as  Nation  ',  and  sulfonated  polyarylenes,  IEC  increases  are  typically 
achieved  at  the  expense  of  mechanical  stability,  resulting  from  polymer  swelling.  In  contrast,  our 
design,  which  utilizes  both  rigid  hydrophobic  backbones  and  highly  acidic  hydrophilic  side 
chains,  will  allow  increased  values  of  IECs  compared  to  Nation®  with  minimal  swelling. 


Strategy  2:  Copolymers  with  hydrophobic  PFCB-based  backbones  and  hydrophilic  side 
chains  containing  N-heterocycles  and  acidic  functional  groups 


F 

\  Z^F 

A 

F 

l 

7 

Uf 

A  1  r 

/v  / 

T, 

Heterocycle 
Heterocycle: 


Acid  group: 


N .  NH 
N 


O  O 

X 


R 


o  q 


Acid  group 

av 


OH 


\-OH 

OH 


R:  O,  C=0,  0=S=0 

Figure  4.  Proposed  copolymers  prepared  from 
aryl  trifluoro  vinyl  ether  monomers  N- 
containing  heterocycles  or  acidic  groups. 


R2=  Br,  TsO,  S02CI,  COCI 
R3=  Br,  COOH,  O — CH2— = 

Figure  5.  Proposed  scheme  for  synthesis  of 
aryl  TFVE  monomers  (Ri  is  the  precursor  of 
R)  and  oligo(oxyethylene)  containing  reactive 
groups. 


Operation  of  a  PEMFC  under  essentially 
water-free  conditions  involves  a  different 
mechanism  for  proton  conduction 
(Grotthuss)6’7’8  where  protons  hop  from 
donor  to  acceptor  sites  without  the  transport 
of  associated  water  molecules.  A  number  of 
oligomeric  and  polymeric  systems  with 
covalently  attached  heterocycles  have 
recently  shown  promise  under  essentially 
anhydrous  conditions.  "  We  have  extended 
this  concept  to  copolymers  with  PFCB- 
based  backbones  containing  flexible  side 
chains  terminating  in  heterocycles,  and 
acidic  functionalities  in  the  other,  as  shown 
in  Fig.  4. 

Tethering  the  functional  groups  on  the  side 
chains  allows  for  high  local  mobility.  We 
have  initially  explored  polyether  side  chains 
of  various  lengths  containing  a  number  of 
heterocyclic  and  acid  end  groups  to 
maximize  the  proton  conductivity.  The  meta 
bis- 1,3 -trifluoro vinyl oxybenzene  (I)  in  Fig  5 
contains  a  reactive  functional  group  Ri  that 
is  used  to  attach  a  side  chain  to  the  polymer 
backbone.  The  precursor  of  this  side  chain 
has  two  reactive  functional  groups  (R2  and 
R3)  (Fig.  5).  One  will  be  used  to  attach  the 
side  chain  to  the  backbone,  while  the  other 
will  be  used  to  attach  either  the  heterocyclic 
or  acid  group.  The  reactive  functional 
groups  may  be  acyl  chloride  (COCI  or 
SO2CI),  halogen  (Br  or  Cl),  hydroxy  (OH), 
carboxyl  (COOH),  or  alkynyl  (C=C). 


Strategy  3:  Copolymers  with  hydrophobic  perfluoro  aromatic  ether-based  backbones  and 
highly  sulfonated  group 


We  designed  another  structure  of  copolymer  containing  hydrophilic  and  hydrophobic  domains 
for  developing  properties  of  polymer  electrolyte  membrane  to  meet  the  operation  conditions  as 
suggested  in  DOE  guideline.  We  prepared  the  rigid-rod  type  polymer  as  biphenyl  ether  and 
phenyl  sulfone.  The  polymer  contains  two  side  chains  (Figure  6)  that  could  induce  the  formation 
of  liquid  crystalline  property.  This  property  can  form  nanos  tract  tired  proton  conducting  channels. 
We  can  also  easily  control  the  channel  length  by  changing  aliphatic  chain  length. 


Experimental  Procedure:  Results  and  Discussion 
Progress  on  Strategy  1-3: 


Strategy  1:  Copolymers  with  hydrophobic  PFCB-based  backbones  and  hydrophilic  side 
chains  containing  pendant  acidic  functional  groups 

Polysulfone  containing  at  least  three  sulfonimide  acid  groups 

Using  the  chemistry  derived  from  the  functionalization  of  commercially-available  polymer, 
Udel®’s  polysulfone  (PSU),  the  hydrophilic  segments  were  prepared.  Nonetheless,  the  PSU- 
based  membrane  was  characterized  and  tested  for  proton  conductivity. 

Synthesis  route  and  digital  image  of  the  PSU -based  membrane 


PSU  membrane  cast  from  TI II 

Figure  7.  Synthetic  route  to  PSU-functionalized  PEM; 

Digital  images  of  PSU-functionalized  PEM 


Thermal  gravimetric  analysis  (TGA)  of  the  PSU-based  membrane  shows  that  the  membrane  is 
stable  up  to  200  °C,  even  in  the  acidified  form.  The  acidification  process  involves  refluxing  the 
membrane  in  1.5  N  H2SO4,  washing  with  distilled  water,  and  drying  between  sheets  of  task 
wipers  at  room  temperature  overnight. 


Preliminary  proton  conductivity  testing  of  the  PSU-based  polymer  shows  promising 
conductivities  up  to  0.07  S/cm  at  80°C  and  90%  RH  as  shown  below.  Current  investigation  and 
optimization  of  the  polymer  synthesis  is  underway. 


Figure  8.  Proton  conductivity  measurements  for  PSU-functionalized  PEM  at  various 
Temp  and  RH  conditions 


Perfluorocyclobutane  (PFCB)  polymers  containing  sulfonimide  acid  groups 


In  the  synthesis  of  the  PFCB  polymers,  two  routes  were  used:  pre-polymer  functionalization  and 
post-polymerization  chemistry. 

Via  monomer  functionalization  prior  to  polymerization:  This  route  was  optimized  to  5  steps, 
instead  of  the  original  planned  route  shown  below. 
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Figure  9.  General  route  for  monomer  functionalization  prior  to  polymerization 


Figure  10.  Optimized  route  for  monomer  functionalization  prior  to  polymerization 

The  hydrophilic  segment  has  been  prepared,  cast  and  is  currently  being  tested  and 
characterized  by  spectroscopy  techniques,  thermal/chemical  stability  and  proton 
conductivity  testing.  Future  work  includes  combination  of  the  said  hydrophilic  segment 
with  the  hydrophobic  segments  synthesized  similarly  (steps  1,  2  and  7  in  Optimized 
Route)  to  form  block  copolymers  and  blends. 

Via  yost-sulfonation  ofPFCB  via  lithiation:  Another  route  is  post-sulfonation  on  the  PFCB 
polymer.  Two  hydrophilic  segments  have  been  by  post-sulfonation,  the  sulfonyl  biphenyl  and 
the  biphenyl  PFCB.  The  following  routes  are  presented. 


Post-sulfonation  of  Polymer  A 


Figure  11.  Post-sulfonation  via  lithiation  procedures 


Post-sulfonation  of  Polymer  B 

1 

:  1)  KOH,  DMSO,  Toluene,  4  d,  120°C 

HQ — ^  ^ QH  2)  BrCF2Cf2Br ,  2d,  25-50°C  ^  BrF2CF2CO— OCF2CF2Br 

;  2  Zn,  anhydrous  acetonitrile 


1)  Chlorosulfonic  acid  5  h  Monomer  B 

2)  Thionyl  chloride  in  CH2C12,  2  d 


f3co2shno2s 


Figure  12.  Post-sulfonation  via  chlorosulfonation 


Both  polymers  have  been  functionalized  and  are  currently  being  characterized  and  tested 
for  fuel  cell  measurements  and  performance. 


General  Synthetic  Procedure 

Trifluorovinyloxybenzene  Monomer  Preparation 

Monomers  A  and  B  were  synthesized  based  on  an  adapted  published  procedure.  In  a  500-irrL  3- 
neck  round  bottom  flask  fitted  with  a  Dean-Stark  apparatus  and  reflux  condenser,  25.00  g  of 
phenolic  starting  material,  2.2  molar  equivalents  of  KOH  and  1:4  v/v  ratio  of  dimethylsulfoxide 
and  toluene  were  combined  under  N2.  The  solution  was  boiled  under  reflux  120  °C  for  3-4  d,  or 
until  no  water  is  distilled  out  with  toluene  in  the  Dean-Stark  apparatus  arm.  After  the  reflux,  the 
remaining  toluene  was  removed.  The  reaction  mixture  was  allowed  to  cool  down  to  room 
temperature,  and  further  to  15-20  °C  for  the  addition  of  the  alkylation  reagent,  1,2- 
dibromotetrafluoroethane  (BrCF2CF2Br)  under  N2.  Three  molar  equivalents  of  BrCF2CF2Br  was 
added  dropwise  to  the  reaction  mixture  over  at  least  1.5  h  at  less  than  20  °C.  After  the  complete 
addition  of  the  alkylating  agent,  the  reaction  mixture  was  stirred  at  room  temperature  for  12  h, 
and  at  50  °C  for  36  h.  The  reaction  was  stopped  by  adding  100  irrL  of  water.  After  stirring  for 
15  min,  the  organic  and  aqueous  layers  were  separated.  The  aqueous  layer  was  extracted  with 
dichloromethane  three  times  (100  mL  each).  The  combined  organic  layers  were  washed  with 
water  followed  by  a  saturated  brine  solution.  The  resulting  organic  layer  was  dried  using 
anhydrous  Na2SC>4.  Purification  of  the  crude  product  was  performed  using  column 
chromatography  with  silica  as  the  stationary  phase,  and  ethyl  acetate/hexane  as  solvent  system. 

The  alkylated  compounds  were  then  dehalogenated  using  the  following  procedure:  In  a  250-mL 
round  bottom  flask,  the  alkylated  compound,  8  molar  equivalents  of  freshly  activated  Zn,  and  dry 
acetonitrile  were  added  under  N2.  The  reaction  mixture  was  boiled  under  reflux  for  3-4  d. 
Acetonitrile  was  then  removed  by  vacuum  distillation,  leaving  the  crude  product,  salts  and  Zn. 
The  crude  product  was  extracted  using  hexanes,  and  the  other  solids  separated  by  filtration.  The 
filtrate  was  concentrated,  and  the  crude  product  purified  using  column  chromatography  on  silica 
and  ethyl  acetate/hexane  as  solvent  system  to  yield  the  monomer  desired. 

Polymerization 


Three  grams  of  the  monomer  was  placed  in  a  25-mL  round  bottom  flask.  With  the  heating  oil 
bath  at  >185  °C,  the  flask  was  immersed,  ensuring  melting  of  the  polymer.  The  temperature  was 
raised  to  the  desired  temperature,  and  maintained  for  4-5  d.  The  resulting  polymer  was  dissolved 
in  chloroform,  and  precipitated  out  using  methanol.  A  light  brown  polymer  was  obtained  and 
dried  under  vacuum  for  12  h  at  >  50  °C. 

Strategy  2:  Copolymers  with  hydrophobic  PFCB-based  backbones  and  hydrophilic  side 
chains  containing  N-heterocycles  and  acidic  functional  groups 

We  have  prepared  a  polymer  consisting  of  six-carbon  polyether  side  chains  containing  a 
number  of  heterocyclic  (1,  2,  3-triazole)  and  acid  (sulfonic  and  phosphonic  acid)  end  groups. 
The  meta  bis- 1 ,  3 -trifluorovinyloxybenzene  was  used  to  attach  the  polyether  side  chain  to  the 


polymer  backbone.  At  this  point,  optimization  for  the  synthetic  routes  is  underway,  and  at  the 
same  time,  a  proton  exchange  membrane  is  ready  to  be  cast  and  characterized  by  proton 
conductivity,  ion-exchange  determination  (IEC),  tensile  strength,  thermal  and  chemical  stability. 


Synthesis  and  characterization  of  comb-shaped  perfluorocyclobutyl  aromatic  polyethers 
bearing  adjustable  phosphonated  pendant  chain 
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General  procedure  for  synthesis  of  compound  5 
A  solution  of  2  (4.84  g,  10  mmol),  4  (10  mmol)  and  K2CO3  (2.07  g,  15  mmol)  in  CH3CN  (30 
mL)  was  stirred  at  reflux  for  48  h.  After  solvent  was  removed,  H2O  (100  mL)  and  EtOAc  (20 
inL)  were  added.  The  aqueous  phase  was  separated  and  extracted  with  EtOAc  (2x10  mL).  The 
combined  organic  phases  were  washed  with  H2O  (2  x  10  mL),  brine  (2  x  10  mL),  dried  over 
anhydrous  MgS04.  After  removal  of  the  solvent,  the  residue  was  purified  by  silica  column 
chromatography  using  hexane-acetone  (1:1,  v/v)  to  give  a  colorless  oil  (5). 

Compound  5a:  88%  yield;  'H  NMR  (CDCI3,  ppm)  8  6.69  (d,  3H),  3.95-4.12  (m,  6H),  3.68-3.83 
(m,  12H),  1.96-2.22  (m,  2H),  1.24  (t,  6  H).  13C  NMR  (CDC13,  ppm)  5  160.5,  149.8,  115.2  (tt, 
1C),  113.2  (tt,  1C),  107.6,  106.9,  70.6,  70.1,  69.2,  68.5,  61.7  (d,  1C),  28.7,  26.2,  16.3  (d,  1C).  19F 
NMR  (CDCI3,  ppm)  5  -  67.9  (t,  IF),  -  86.0  (t,  IF).  31P  NMR  (CDCI3,  ppm)  5  26.4.  IR  (neat,  cm" 
')  3098,  2983,  2879,  1605,  1457,  1324,  1102,  793.  Compound  5b:  86%  yield;  *H  NMR  (CDC13, 
ppm)  5  6.69  (d,  3H),  3.95-4.12  (m,  6H),  3.68-3.83  (m,  12H),  1.96-2.22  (m,  2H),  1.24  (t,  6  H). 
13C  NMR  (CDCI3,  ppm)  5  160.3,  149.7,  115.5  (tt,  1C),  113.5  (tt,  1C),  107.7,  107.1,  70.8,  70.2, 
69.4,  68.3,  61.6  (d,  1C),  28.8,  26.0,  16.4  (d,  1C).  19F  NMR  (CDC13,  ppm)  5  -  67.9  (t,  IF),  -  86.0 
(t,  IF).  31P  NMR  (CDCI3,  ppm)  5  26.2.  IR  (neat,  cm  1)  3098,  2983,  2879,  1605,  1457,  1324, 
1 102,  793.  Compound  5c:  84%  yield;  !H  NMR  (CDCI3,  ppm)  8  6.69  (d,  3H),  3.95-4.12  (m,  6H), 
3.68-3.83  (m,  12H),  1.96-2.22  (m,  2H),  1.24  (t,  6  H).  13C  NMR  (CDC13,  ppm)  8  160.1,  149.5, 
115.4  (tt,  1C),  113.3  (tt,  1C),  107.4,  106.9,  70.7,  70.3,  69.2,  68.1,  61.4  (d,  1C),  28.7,  25.9,  16.2 
(d,  1C).  19F  NMR  (CDCI3,  ppm)  8  -  67.9  (t,  IF),  -  86.0  (t,  IF).  31P  NMR  (CDC13,  ppm)  8  26.3. 
IR  (neat,  cm-1)  3098,  2983,  2879,  1605,  1457,  1324,  1 102,  793. 

1.1.5.  General  procedure  for  synthesis  of  compound  6 

A  solution  of  5  (5  mmol)  and  Zn  (0.975  g,  15  mmol)  in  CH3CN  (20  mL)  was  stirred  at  reflux 
for  48  h.  After  the  solvent  was  removed,  the  residue  was  diluted  with  CFLCL  and  filtered.  The 
filtration  was  washed  with  FLO  (2x10  mL),  brine  (2x10  mL),  dried  over  anhydrous  MgSCL. 
After  removal  of  the  solvent,  the  residue  was  purified  by  silica  column  chromatography  using 
hexane-acetone  (1:1,  v/v)  to  give  a  colorless  oil  (6). 

General  procedure  for  synthesis  of polymer  7 

Compound  6  (2.69  g,  5  mmol)  was  added  to  a  pre-dried  25  mL  flask.  After  three  cycles  of 
freeze-pump-thaw,  the  flask  was  heated  at  180  °C  for  3  d  and  200  °C  for  12  h.  After  cooling  to 
RT,  the  residue  was  dissolved  in  CHCI3  (10  mL).  The  resulting  solution  was  precipitated  in 
hexane  to  give  polymer  7. 

Polymer  7a:  94  %  yield;  !H  NMR  (DMSO-d6,  ppm)  8  6.98  (s,  3H),  3.75-4.25  (d,  6H),  3.30-3.75 
(d,  4H),  1.82-1.93  (m,  2H),  1.22  (s,  6  H).  13C  NMR  (DMSO-d6,  ppm)  8  161.2,  156.7,  117.0, 
115.0,  107.0  105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  65.2,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR 
(DMSO-de,  ppm)  8  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  - 
132.1.  31P  NMR  (DMSO-de,  ppm)  8  25.6.  IR  (KBr,  cm"1)  3098,  2877,  1604,  1482,  1271,  1158, 
1031,988,961,792. 

Polymer  7b:  95  %  yield;  *H  NMR  (DMSO-d6,  ppm)  8  6.98  (s,  3H),  3.75-4.25  (d,  6H),  3.30-3.75 
(d,  8H),  1.82-1.93  (m,  2H),  1.22  (s,  6  H).  13C  NMR  (DMSO-d6,  ppm)  8  161.2,  156.7,  117.0, 
115.0,  107.0  105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  65.2,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR 
(DMSO-de,  ppm)  8  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  - 


132.1.  31P  NMR  (DMSO-de,  ppm)  5  25.4.  IR  (KBr,  cm-1)  3098,  2877,  1602,  1482,  1271,  1158, 
1031,988,961,792. 

Polymer  7c:  92  %  yield;  *H  NMR  (DMSO-d6,  ppm)  5  6.98  (s,  3H),  3.75-4.25  (d,  6H),  3.30-3.75 
(d,  12H),  1.82-1.93  (m,  2H),  1.22  (s,  6  H).  13C  NMR  (DMSO-d6,  ppm)  5  161.2,  156.7,  117.0, 
115.0,  107.0  105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  65.2,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR 
(DMSO-d6,  ppm)  5  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  - 
132.1.  31P  NMR  (DMSO-de,  ppm)  5  25.3.  IR  (KBr,  cm-1)  3098,  2877,  1606,  1482,  1271,  1158, 
1031,988,961,792. 

General  procedure  for  synthesis  of polymer  8 

To  a  solution  of  polymer  7  (2  g)  in  CHCI3  (50  mL)  was  added  bromotrimethylsilane  (10  mL). 
The  resulting  solution  was  stirred  at  RT  for  12  h  and  40  °C  for  24  h.  After  removal  of  any 
volatile,  the  residue  was  dissolved  in  methanol  (50  mL)  and  then  stirred  at  RT  for  12  h.  The 
polymer  was  subsequently  precipitated  in  50  mL  of  water  containing  5  vol  %  fuming 
hydrochloric  acid.  After  filtration,  the  polymer  was  dried  at  100  °C  for  24  h. 

Polymer  8a:  89  %  yield;  'H  NMR  (DMSO-d6,  ppm)  5  6.98  (s,  3H),  4.01-4.26  (s,  2H),  3.26-3.78 
(d,  4H),  1.82-1.93  (m,  2H).  13C  NMR  (DMSO-d6,  ppm)  5  161.2,  156.7,  117.0,  115.0,  107.0 
105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR  (DMSO-d6,  ppm)  5  - 

126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  -  132.1.  31P  NMR  (DMSO- 
de,  ppm)  8  25.7.  IR  (KBr,  cm ')  3465,  3098,  2877,  1604,  1482,  1271,  1158,  1031,988,961,792. 
Polymer  8b:  91  %  yield;  NMR  (DMSO-d6,  ppm)  5  6.98  (s,  3H),  3.98-4.26  (s,  2H),  3.26-3.78 
(d,  8H),  1.82-1.93  (m,  2H).  13C  NMR  (DMSO-d6,  ppm)  5  161.2,  156.7,  117.0,  115.0,  107.0 
105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR  (DMSO-d6,  ppm)  5  - 

126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  -  132.1.  31P  NMR  (DMSO- 
de,  ppm)  8  25.5.  IR  (KBr,  cm ')  3465,  3098,  2877,  1604,  1482,  1271,  1158,  1031,988,961,792. 
Polymer  8c:  87  %  yield;  ‘H  NMR  (DMSO-d6,  ppm)  5  6.98  (s,  3H),  3.96-4.26  (s,  2H),  3.26-3.78 
(d,  12H),  1.82-1.93  (m,  2H).  13C  NMR  (DMSO-d6,  ppm)  5  161.2,  156.7,  117.0,  115.0,  107.0 
105.0,  99.0,  96.8,  70.7,  70.1,  69.4,  68.1,  61.6  (d,  1C),  28.0,  26.0.  19F  NMR  (DMSO-d6,  ppm)  5  - 

126.7,  -  127.2,  -127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  -  132.1.  31P  NMR  (DMSO- 
d6,  ppm)  5  25.6.  IR  (KBr,  cm ')  3465,  3098,  2877,  1604,  1482,  1271,  1158,  1031,988,961,792. 


Synthesis  and  Characterization  of  Perfluorocyclobutyl  Aromatic  Polyethers  Bearing 
1H, 2, 3-Triazole  Rings  on  Oligo(ethylene  oxide)s  Chains 
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Scheme  4.  Synthesis  of  polymer  7  and  8. 

General  procedure  for  synthesis  of  compound  4 

3  (10  mmol)  and  azidomethyl  pivalate  (1.57  g,  10  mmol)  were  suspended  in  THF-ICO  (20  lnL, 
1:1).  To  this  mixture  was  added  CuS04-5H20  (0.125  g,  0.5  mmol)  and  sodium  ascorbate  (0.594 


g,  3  mmol).  The  mixture  was  stirred  at  room  temperature  for  48  h.  The  reaction  mixture  was 
diluted  with  H2O  (20  mL),  and  extracted  with  EtOAc  (2  x  20  mL).  The  combined  organic  layers 
were  washed  with  brine  (2x10  mL),  dried  over  anhydrous  MgSCL,  filtered,  and  the  solvent  was 
removed  under  reduced  pressure.  The  residue  was  purified  by  column  chromatography  (Hexane  - 
EtOAc,  1:3)  to  give  a  colorless  oil  (4). 

Compound  4a:  96%  yield;  *H  NMR  (CDC13,  ppm)  5  7.66  (s,  1H),  7.53  (d,  2H),  7.11  (d,  2H), 
6.01  (s,  2H),  4.43  (s,  2H),  3.90  (t,  2H),  3.33-3.45  (m,  6  H),  2.19  (s,  3H),  0.93  (s,  9H).  13C  NMR 
(67  MHz,  CDCI3)  5  177.3,  145.3,  144.8,  132.8,  129.8,  127.8,  124.2,  70.5,  70.4,  70.3,  69.6,  69.4, 

68.5,  64.2,  38.6,  26.7,  21.5.  IR  (neat,  cm’1)  3148,  2972,  2873,  1745,  1599,  1454,  1359,  1284, 

1179. 1137. 1038. 923. 814,  775, 666. 

Compound  4b:  93%  yield;  !H  NMR  (CDC13,  ppm)  5  7.67  (s,  1H),  7.53  (d,  2H),  7.11  (d,  2H), 
6.01  (s,  2H),  4.43  (s,  2H),  3.90  (t,  2H),  3.33-3.45  (m,  10H),  2.19  (s,  3H),  0.93  (s,  9H).  13C  NMR 
(67  MHz,  CDCI3)  5  177.3,  145.3,  144.8,  132.8,  129.8,  127.8,  124.2,  70.5,  70.4,  70.3,  69.6,  69.4, 

68.5,  64.2,  38.6,  26.7,  21.5.  IR  (neat,  cm’1)  3146,  2971,  2874,  1744,  1598,  1455,  1358,  1282, 

1179. 1137. 1035. 921. 814,  772, 664. 

Compound  4c:  92%  yield;  *H  NMR  (CDCI3,  ppm)  5  7.65  (s,  1H),  7.53  (d,  2H),  7.11  (d,  2H),  6.01 
(s,  2H),  4.43  (s,  2H),  3.90  (t,  2H),  3.33-3.45  (m,  14  H),  2.19  (s,  3H),  0.93  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.3,  145.3,  144.8,  132.8,  129.8,  127.8,  124.2,  70.5,  70.4,  70.3,  69.6,  69.4, 

68.5,  64.2,  38.6,  26.7,  21.5.  IR  (neat,  cm’1)  3144,  2971,  2870,  1743,  1598,  1455,  1357,  1282, 
1176, 1134, 1038, 920, 816, 772, 663. 

General  procedure  for  synthesis  of  compound  5 
A  solution  of  2  (4.84  g,  0.01  mol),  4  (0.01  mol)  and  K2CO3  (2.76  g,  0.02  mol)  in  CH3CN  (50 
mL)  was  stirred  at  reflux  for  48  h.  After  solvent  was  removed,  the  residue  was  partitioned  in 
water  and  ethyl  acetate.  The  organic  layer  was  separated  and  the  aqueous  layer  was  extracted 
with  ethyl  acetate  (2x10  mL).  The  combined  organic  layers  were  washed  with  H2O  and  brine, 
dried  over  anhydrous  sodium  sulfate,  and  evaporated  under  reduced  pressure.  The  residue  was 
purified  by  column  chromatography  (Hexane-EtOAc,  2:7)  to  give  a  colorless  oil  (5). 

Compound  5a:  89%  yield;  !H  NMR  (CDC13,  ppm)  5  7.78  (s,  1H),  6.72  (d,  3H),  6.07  (s,  2H),  4.65 
(s,  2H),  4.08M.15  (m,  2H),  3.76-3.85  (m,  2H),  3.64-3.68  (m,  8H),  1.13  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.0,  160.4,  149.7,  147.3,  135.5,  115.8  (tt,  1C),  113.4  (tt,  1C),  107.7,  106.9, 
74.0,  71.0,  70.7,  70.6,  70.0,  69.4,  68.3,  64.4,  38.9,  26.9.  19F  NMR  (CDCI3,  ppm)  5  -  67.9  (t,  IF), 

-  86.0  (t,  IF).  IR  (neat,  cm'1)  3105,  2980,  2875,  1745,  1609,  1473,  1325,  1277,  1105,  989,  925, 
841,801. 

Compound  5b:  86%  yield;  *H  NMR  (CDC13,  ppm)  5  7.78  (s,  1H),  6.72  (d,  3H),  6.07  (s,  2H),  4.65 
(s,  2H),  4.08M.15  (m,  2H),  3.76-3.85  (m,  2H),  3.64-3.68  (m,  8H),  1.13  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.0,  160.4,  149.7,  147.3,  135.5,  115.8  (tt,  1C),  113.4  (tt,  1C),  107.7,  106.9, 
74.0,  71.0,  70.7,  70.6,  70.0,  69.4,  68.3,  64.4,  38.9,  26.9.  19F  NMR  (CDC13,  ppm)  5  -  67.9  (t,  IF), 

-  86.0  (t,  IF).  5  21.3.  IR  (neat,  cm'1)  3103,  2965,  2877,  1746,  1610,  1462,  1325,  1286,  1103, 
989,  925,  866,  794. 

Compound  5c:  87%  yield;  'H  NMR  (CDC13,  ppm)  5  7.78  (s,  1H),  6.72  (d,  3H),  6.07  (s,  2H),  4.65 
(s,  2H),  4.08^1.15  (m,  2H),  3.76-3.85  (m,  2H),  3.64-3.68  (m,  8H),  1.13  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.0,  160.4,  149.7,  147.3,  135.5,  115.8  (tt,  1C),  113.4  (tt,  1C),  107.7,  106.9, 
74.0,  71.0,  70.7,  70.6,  70.0,  69.4,  68.3,  64.4,  38.9,  26.9.  19F  NMR  (CDC13,  ppm)  5  -  67.9  (t,  IF), 

-  86.0  (t,  IF).  IR  (neat,  cm'1)  3104,  2983,  2875,  1742,  1606,  1476,  1323,  1279,  1106,  988,  923, 
842, 800. 


General  procedure  for  synthesis  of  compound  6 

A  solution  of  5  (10  mmol)  and  Zn  (1.95  g,  30  mmol)  in  CH3CN  (30  inL)  was  stirred  at  reflux 
for  48  h.  After  solvent  was  removed,  the  residue  was  diluted  with  ethyl  acetate  and  filtered.  The 
organic  layer  was  washed  with  H2O  and  brine,  dried  over  anhydrous  sodium  sulfate,  and 
evaporated  under  reduced  pressure.  The  residue  was  purified  by  column  chromatography 
(Hexane-EtOAc,  1:3)  to  give  a  colorless  oil  (6). 

Compound  6a:  77%  yield;  !H  NMR  (CDCI3,  ppm)  5  7.77  (s,  1H),  6.44  (d,  3H),  6.20  (s,  2H),  4.65 
(s,  2H),  4.06-4.11  (m,  2H),  3.79-3.85  (m,  2H),  3.62-3.67  (m,  8H),  1.15  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.0,  160.3,  156.7,  147.2,  146.5  (ddd,  1C),  135.5,  134.6  (tt,  1C),  99.1,  96.9, 
74.0,  71.0,  70.7,  70.6,  70.0,  69.4,  68.3,  64.4,  38.9,  26.9.  19F  NMR  (CDC13,  ppm)  5  -  119.0  (dd, 
IF),  -  125.7  (dd,  IF),  -  134.6  (dd,  IF).  IR  (neat,  cm'1)  3145,  3098,  2983,  2878,  1834,  1746, 
1607,  1474,  1278,  1107,  988,  840,  800. 

Compound  6b:  74%  yield;  [H  NMR  (CDCI3,  ppm)  8  7.77  (s,  1H),  6.42  (d,  3H),  6.21  (s,  2H),  4.64 
(s,  2H),  4.06-4.11  (m,  2H),  3.79-3.85  (m,  2H),  3.62-3.67  (m,  8H),  1.15  (s,  9H).  13C  NMR  (67 
MHz,  CDCb)  5  177.0,  160.3,  156.7,  147.2,  146.5  (ddd,  1C),  135.5,  134.6  (tt,  1C),  99.1,  96.9, 

74.2,  70.9,  70.7,  70.6,  69.9,  69.5,  68.3,  64.5,  38.8,  26.8.  19F  NMR  (CDC13,  ppm)  5  -  119.0  (dd, 
IF),  -  125.7  (dd,  IF),  -  134.6  (dd,  IF).  IR  (neat,  cm'1)  3142,  3095,  2985,  2875,  1833,  1745, 
1609,  1473,  1276,  1105,  989, 841, 801. 

Compound  6c:  72%  yield;  ’H  NMR  (CDC13,  ppm)  5  7.77  (s,  1H),  6.42  (d,  3H),  6.17  (s,  2H),  4.63 
(s,  2H),  4.06-4.11  (m,  2H),  3.79-3.85  (m,  2H),  3.62-3.67  (m,  8H),  1.13  (s,  9H).  13C  NMR  (67 
MHz,  CDCI3)  5  177.0,  160.3,  156.7,  147.2,  146.5  (ddd,  1C),  135.5,  134.6  (tt,  1C),  99.1,  96.9, 
70.9,  70.7,  70.6,  69.9,  69.5,  68.3,  64.5,  38.8,  26.8.  19F  NMR  (CDC13,  ppm)  5  -  119.0  (dd,  IF),  - 
125.7  (dd,  IF),  -  134.6  (dd,  IF).  IR  (neat,  cm'1)  3140,  3092,  2986,  2872,  1832,  1743,  1606, 
1475,  1274,  1103,987,841,802. 


General  procedure  for  synthesis  of polymer  7 

Compound  2  (0.345  g,  0.56  mmol)  was  added  to  a  pre-dried  50  mL  flask.  The  flask  was  pumped 
and  fdled  back  with  nitrogen  for  3  times.  Then  the  reaction  mixture  was  stirred  at  170  °C  for  16  h 
and  190  °C  for  4  h.  After  cooling  to  RT,  the  residue  was  dissolved  in  CHCI3  (10  mL).  The 
resulting  solution  was  precipitate  in  hexane  to  give  polymer  7. 

Polymer  7a:  95  %  yield;  !H  NMR  (CDC13,  ppm)  5  7.66  (s,  1H),  6.58  (d,  3H),  6.20  (s,  2H),  4.62 
(s,  2H),  4.03  (s,  2H),  3.56-3.86  (d,  6H),  1.13  (s,  9H).  13C  NMR  (CDC13,  ppm)  5  177.0,  160.3, 

156.7,  147.2,  146.5  (ddd,  1C),  135.5,  134.6  (tt,  1C),  99.1,  96.9,  74.2,  70.9,  70.7,  70.6,  69.9,  69.5, 

68.3,  64.5,  38.8,  26.8.  19F  NMR  (CDC13,  ppm)  5  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  - 
130.0,  -  130.7,  -  131.3,  -  132.1.  IR  (KBr,  cm-1)  3114,  2973,  2885,  1744,  1608,  1468,  1306, 
1207,  1126,  1018,914,  841,736. 

Polymer  7b:  93  %  yield;  *H  NMR  (CDCI3,  ppm)  5  7.67  (s,  1H),  6.58  (d,  3H),  6.20  (s,  2H),  4.63 
(s,  2H),  4.05  (s,  2H),  3.52-3.91  (d,  10H),  1.14  (s,  9H).  13C  NMR  (CDC13,  ppm)  5  177.0,  160.7, 

153.7,  147.3,  135.5,  102.3,  101.3,  74.0,  70.9,  70.7,  70.6,  70.0,  69.4,  68.1,  64.4,  38.9,  26.9.  19F 
NMR  (CDCI3,  ppm)  5  -  126.6,  -  127.1,  -  127.8,  -  128.4,  -  129.3,  -  130.0,  -  130.6,  -  131.3,  - 
132.1.  IR  (KBr,  cm-1)  2971,2878,  1746,  1612,  1467,  1306,  1205,  1126,  1014. 

Polymer  7c:  92  %  yield;  *H  NMR  (CDC13,  ppm)  5  7.67  (s,  1H),  6.55  (d,  3H),  6.19  (s,  2H),  4.60 
(s,  2H),  4.03  (s,  2H),  3.54-3.89  (d,  14H),  1.14  (s,  9H).  13C  NMR  (CDC13,  ppm)  5  177.0,  160.3, 

156.7,  147.2,  146.5  (ddd,  1C),  135.5,  134.6  (tt,  1C),  99.1,  96.9,  74.2,  70.9,  70.7,  70.6,  69.9,  69.5, 

68.3,  64.5,  38.8,  26.8.  19F  NMR  (CDCI3,  ppm)  5  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  - 


130.0,  -  130.7,  -  131.3,  -  132.1.  IR  (KBr,  cm-1)  3110,  2968,  2878,  1743,  1606,  1465,  1306, 
1202,  1123,  1016,913,840,  731. 


General  procedure  for  synthesis  of polymer  8 

The  flask  was  cooled  to  room  temperature.  MeOH  (4  iuL)  and  1  M  NaOH  (2  mL)  were 
added.  The  resulting  mixture  was  stirred  at  room  temperature  for  12  h  and  subsequently 
neutralized  with  1  M  HCL  and  diluted  with  H2O  (4  mL).  The  liquid  was  removed  and  the  residue 
was  washed  with  water,  dried  under  vacuum  overnight  to  give  a  brown  solid  (8). 

Polymer  8a:  86  %  yield;  *H  NMR  (DMSO-d6,  ppm)  !H  NMR  (DMSO-d6,  ppm)  5  7.76  (s,  1H), 
6.65  (d,  3H),  4.53  (s,  2H),  4.10  (s,  2H),  3.53-3.69  (m,  6H).  13C  NMR  (DMSO-d6,  ppm)  5  .  19F 
NMR  (DMSO-d6,  ppm)  5  -  126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  - 
132.1.  IR  (KBr,  cm-1)  3414,  2885,  1618,  1468,  1306,  1213,  1126,  1008,931,841,716. 

Polymer  8b:  88  %  yield;  'H  NMR  (DMSO-d6,  ppm)  5  7.70  (s,  1H),  6.61  (d,  3H),  4.48  (s,  2H), 
3.38-4.15  (d,  12H).  13C  NMR  (DMSO-d6,  ppm)  5  161.2,  153.5,  146.0,  134.7,  103.0,  100.0  76.2, 
70.3,  70.1,  69.7,  69.5,  69.0,  63.7.  19F  NMR  (DMSO-d6,  ppm)  5  -  126.7,  -  127.2,  -  127.9,  - 
128.5,-  129.4,-  130.0,-  130.7,-  131.3,-  132.1.  IR  (KBr,  cm1)  3423,2880,  1612,  1465,  1306, 
1205,  1128,  1013,947. 

Polymer  8c:  82  %  yield;  'H  NMR  (DMSO-d6,  ppm)  5  7.75  (s,  1H),  6.66  (d,  3H),  4.52  (s,  2H), 
4.08  (s,  2H),  3.51-3.71  (m,  14H).  13C  NMR  (DMSO-d6,  ppm)  5  .  19F  NMR  (DMSO-d6,  ppm)  5  - 
126.7,  -  127.2,  -  127.9,  -  128.5,  -  129.4,  -  130.0,  -  130.7,  -  131.3,  -  132.1.  IR  (KBr,  cm1) 
3410,  2878,  1616,  1465,  1308,  1209,  1 120,  1003,  930,  840,  711. 

Synthesis  of  polymer  7  and  8 

The  preparation  procedure  of  polymer  7  and  8  was  shown  in  Scheme  2.  Polymer  7  was  prepared 
by  the  thermal  [2k  +  2tt]  cyclopolymerization  of  6  in  bulk.  This  polymerization  process  did  not 
require  catalysts  or  initiators  and  gave  polymer  7  in  excellent  yield.  The  formation  of  polymer  7 
was  confirmed  by  1 H,  13C,  19F  NMR  and  FT-IR  spectra. 

Strategy  3:  Copolymers  with  hydrophobic  perfluoro  aromatic  ether-based  backbones  and 
highly  sulfonated  group 

We  have  synthesized  side  chains  for  forming  well-ordered  nanostructure.  The  side  chain  without 
sodium  sulfuric  acid  shows  liquid  crystalline  property.  Generally  cyanobiphenol  shows  a  nematic 
mesophase  itself.  Even  if  it  has  a  flexible  alkyl  chain  to  hydroxyl  group,  it  exhibits  the  same 
mesophase.  According  to  these  results,  6-bromohexanoxybiphenyl  would  show  mesophase-like 
nematic  phase.  When  cyanophenyl  mesogen  was  attached  to  the  polymer  backbone  like 
polyoxetane,  the  mesophase  changed  from  nematic  to  smectic  phase.  The  formation  of  smectic 
phase  will  improve  the  thermal  stability  of  the  mesophase  rather  than  nematic  phase  so  the 
stability  of  liquid  crystalline  polymer  will  be  more  stable  in  mesophase.  We  are  expecting  the 
sulfonated  biphenyl  moiety  for  having  liquid  crystalline  property.  We  are  ready  to  synthesize  the 
sulfonamide  groups  on  the  polymer  to  check  if  there  is  liquid  crystalline  property. 


Current  and  Future  Work 

Fuel  cell  tests  with  the  membranes  prepared  utilizing  the  three  strategies  will  continue  as 
membrane  castings  are  done.  Tests  and  characterization  work  (proton  conductivity  testing, 
thermo-gravimetric  analysis,  fuel  cell  performance,  AFM,  SEM,  etc.)  of  the  membranes  prepared 
will  also  be  undertaken.  Further  optimization  work  will  also  be  included  along  with  the  tests  to 
meet  DOE  guidelines. 
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